gravitation6-214bbc04-e7e9-47a7-95a9-eb77e413b5ce

снартев 22

THERMODYNAMICS, HYDRODYNAMICS, ELECTRODYNAMICS, GEOMETRIC OPTICS, AND KINETIC THEORY

§22.1. THE WHY OF THIS CHAPTER

Astrophysical applications of gravitation theory are the focus of the rest of this book, except for Chapters 41-44. Each application-stars, star clusters, cosmology, collapse, black holes, gravitational waves, solar-system experiments-can be pursued by itself at an elementary level, without reference to the material in this chapter. But deep understanding of the applications requires a prior grasp of thermodynamics, hydrodynamics, electrodynamics, geometric optics, and kinetic theory, all in the context of curved spacetime. Hence, most Track-2 readers will want to probe these subjects at this point.

§22.2. THERMODYNAMICS IN CURVED SPACETIME*

Consider, for concreteness and simplicity, the equilibrium thermodynamics of a perfect fluid with fixed chemical composition ("simple perfect fluid")-for example, the gaseous interior of a collapsing supermassive star. The thermodynamic state of a fluid element, as it passes through an event P 0 P 0 P_(0)\mathscr{P}_{0}P0, can be characterized by various thermodynamic potentials, such as n , ρ , p , T , s , μ n , ρ , p , T , s , μ n,rho,p,T,s,mun, \rho, p, T, s, \mun,ρ,p,T,s,μ. The numerical value of each potential at P 0 P 0 P_(0)\mathscr{P}_{0}P0 is measured in the proper reference frame ( $ 13.6 $ 13.6 $13.6\$ 13.6$13.6 ) of an observer who moves with the fluid element-i.e., in the fluid element's "rest frame." Despite
This chapter is entirely Track 2. No earlier Track-2 material is needed as preparation for it, but Chapter 5 (stress-energy tensor) will be helpful.
§22.5 (geometric optics) is needed as preparation for Chapter 34 (singularities and global methods). The rest of the chapter is not needed as preparation for any later chapter; but it will be extremely helpful in most applications of gravitation theory (Chapters 23-40).
Thermodynamic potentials are defined in rest frame of fluid
Definitions of thermodynamic potentials
Definition of "simple fluid"
Law of baryon conservation
this use of rest frame to measure the potentials, the potentials are frame-independent functions (scalar fields). At the chosen event P 0 P 0 P_(0)\mathscr{P}_{0}P0, a given potential (e.g., n n nnn ) has a unique value n ( P 0 ) n P 0 n(P_(0))n\left(\mathscr{P}_{0}\right)n(P0); so n n nnn is a perfectly good frame-independent function.
The values of n , ρ , p , T , s , μ n , ρ , p , T , s , μ n,rho,p,T,s,mun, \rho, p, T, s, \mun,ρ,p,T,s,μ measure the following quantities in the rest frame of the fluid element:
n n nnn, baryon number density; i.e., number of baryons per unit three-dimensional volume of rest frame, with antibaryons (if any) counted negatively.
ρ ρ rho\rhoρ, density of total mass-energy; i.e., total mass-energy (including rest mass, thermal energy, compressional energy, etc.) contained in a unit three-dimensional volume of the rest frame.
p p ppp, isotropic pressure in rest frame.
T T TTT, temperature in rest frame.
s s sss, entropy per baryon in rest frame. (The entropy per unit volume is n s n s nsn sns.)
μ μ mu\muμ, chemical potential of baryons in rest frame [see equation (22.8) below].
The chemical composition of the fluid (number density of hydrogen molecules, number density of hydrogen atoms, number density of free protons and electrons, number density of photons, number density of 238 U 238 U ^(238)U{ }^{238} \mathrm{U}238U nuclei, number density of Λ Λ Lambda\LambdaΛ hyperons . . .) is assumed to be fixed uniquely by two thermodynamic variables-e.g., by the total number density of baryons n n nnn and the entropy per baryon s s sss. In this sense the fluid is a "simple fluid." Simple fluids occur whenever the chemical abundances are "frozen" (reaction rates too slow to be important on the time scales of interest; for example, in a supermassive star except during explosive burning and except at temperatures high enough for e e + e e + e^(-)-e^(+)e^{-}-e^{+}ee+pair production). Simple fluids also occur in the opposite extreme of complete chemical equilibrium (reaction rates fast enough to maintain equilibrium despite changing density and entropy; for example, in neutron stars, where high pressures speed up all reactions). When one examines nuclear burning in a nonconvecting star, or explosive nuclear burning, or pair production and neutrino energy losses at high temperatures, one must usually treat the fluid as "multicomponent." Then one introduces a number density n J n J n_(J)n_{J}nJ and a chemical potential μ J μ J mu_(J)\mu_{J}μJ for each chemical species with abundance not fixed by n n nnn and s s sss. For further details see, e.g., Zel'dovich and Novikov (1971).
The most fundamental law of thermodynamics-even more fundamental than the "first" and "second" laws-is baryon conservation. Consider a fluid element whose moving walls are attached to the fluid so that no baryons flow in or out. As the fluid element moves through spacetime, deforming along the way, its volume V V VVV changes. But the number of baryons in it must remain fixed, so
(22.1) d d τ ( n V ) = 0 (22.1) d d τ ( n V ) = 0 {:(22.1)(d)/(d tau)(nV)=0:}\begin{equation*} \frac{d}{d \tau}(n V)=0 \tag{22.1} \end{equation*}(22.1)ddτ(nV)=0
The changes in volume are produced by the flow of neighboring bits of fluid away from or toward each other-explicitly (exercise 22.1)
(22.2) d V / d τ = ( u ) V (22.2) d V / d τ = ( u ) V {:(22.2)dV//d tau=(grad*u)V:}\begin{equation*} d V / d \tau=(\boldsymbol{\nabla} \cdot \boldsymbol{u}) V \tag{22.2} \end{equation*}(22.2)dV/dτ=(u)V
where u = d / d τ u = d / d τ u=d//d tau\boldsymbol{u}=d / d \tauu=d/dτ is the 4 -velocity of the fluid. Consequently, baryon conservation [equation (22.1)] can be reexpressed as
0 = d n d τ + n V d V d τ = u n + n ( u ) = u n + n ( u ) = ( n u ) ; i.e., (22.3) S = 0 , (22.4) S = n u = baryon number-flux vector 0 = d n d τ + n V d V d τ = u n + n ( u ) = u n + n ( u ) = ( n u ) ;  i.e.,  (22.3) S = 0 , (22.4) S = n u =  baryon number-flux vector  {:[0=(dn)/(d tau)+(n)/(V)(dV)/(d tau)=grad_(u)n+n(grad*u)=u*grad n+n(grad*u)=grad*(nu);],[" i.e., "],[(22.3)qquad grad*S=0","],[(22.4)S=nu=" baryon number-flux vector "]:}\begin{align*} & 0=\frac{d n}{d \tau}+\frac{n}{V} \frac{d V}{d \tau}=\boldsymbol{\nabla}_{\boldsymbol{u}} n+n(\boldsymbol{\nabla} \cdot \boldsymbol{u})=\boldsymbol{u} \cdot \boldsymbol{\nabla} n+n(\boldsymbol{\nabla} \cdot \boldsymbol{u})=\boldsymbol{\nabla} \cdot(n \boldsymbol{u}) ; \\ & \text { i.e., } \\ & \qquad \boldsymbol{\nabla} \cdot \boldsymbol{S}=0, \tag{22.3}\\ & \boldsymbol{S}=n \boldsymbol{u}=\text { baryon number-flux vector } \tag{22.4} \end{align*}0=dndτ+nVdVdτ=un+n(u)=un+n(u)=(nu); i.e., (22.3)S=0,(22.4)S=nu= baryon number-flux vector 
(see § 5.4 § 5.4 §5.4\S 5.4§5.4 and exercise 5.3.) Moreover, this abstract geometric version of the law must be just as valid in curved spacetime as in flat (equivalence principle).
Note the analogy with the law of charge conservation, J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0, in electrodynamics (exercise 3.16) and with the local law of energy-momentum conservation, T = 0 ( $ 85.9 T = 0 ( $ 85.9 grad*T=0($85.9\boldsymbol{\nabla} \cdot \boldsymbol{T}=0(\$ 85.9T=0($85.9 and 16.2). In a very deep sense, the forms of these three laws are dictated by the theorem of Gauss ( $ 5.9 $ 5.9 $5.9\$ 5.9$5.9, and Boxes 5.3, 5.4).
The second law of thermodynamics states that, in flat spacetime or in curved, entropy can be generated but not destroyed. Apply this law to a fluid element of volume V V VVV containing a fixed number of baryons N N NNN. The entropy it contains is
S = N s = n s V S = N s = n s V S=Ns=nsVS=N s=n s VS=Ns=nsV
Entropy may flow in and out across the faces of the fluid element ("heat flow" between neighboring fluid elements); but for simplicity assume it does not; or if it does, assume that it flows too slowly to have any significance for the problem at hand. Then the entropy in the fluid element can only increase:
d ( n s V ) / d τ 0 when negligible entropy is exchanged between neighboring fluid elements; d ( n s V ) / d τ 0       when negligible entropy is exchanged between        neighboring fluid elements;  {:[d(nsV)//d tau >= 0," when negligible entropy is exchanged between "],[," neighboring fluid elements; "]:}\begin{array}{ll} d(n s V) / d \tau \geq 0 & \text { when negligible entropy is exchanged between } \\ & \text { neighboring fluid elements; } \end{array}d(nsV)/dτ0 when negligible entropy is exchanged between  neighboring fluid elements; 
i.e. [combine with equation (22.1)]
(22.5) d s / d τ 0 (no entropy exchange). (22.5) d s / d τ 0  (no entropy exchange).  {:(22.5)ds//d tau >= 0" (no entropy exchange). ":}\begin{equation*} d s / d \tau \geq 0 \text { (no entropy exchange). } \tag{22.5} \end{equation*}(22.5)ds/dτ0 (no entropy exchange). 
So long as the fluid element remains in thermodynamic equilibrium, its entropy will actually be conserved [" = = === " in equation (22.5)]; but at a shock wave, where equilibrium is momentarily broken, the entropy will increase (conversion of "relative kinetic energy" of neighboring fluid elements into heat). [For discussions of heat flow in special and general relativity, see Exercise 22.7. For discussion of shock waves, see Taub (1948), de Hoffman and Teller (1950), Israel (1960), May and White (1967), Zel'dovich and Rayzer (1967), Lichnerowicz (1967, 1971), and Thorne (1973a).]
The first law of thermodynamics, in the proper reference frame of a fluid element, Shock waves and heat flow First law of thermodynamics is identical to the first law in flat spacetime ("principle of equivalence"); and in flat spacetime the first law is merely the law of energy conservation:
d ( energy in a volume element containing a fixed number, A , of baryons ) = p d ( volume ) + T d ( entropy ) ; d (  energy in a volume element containing   a fixed number,  A ,  of baryons  ) = p d (  volume  ) + T d (  entropy  ) ; d((" energy in a volume element containing ")/(" a fixed number, "A," of baryons "))=-pd(" volume ")+Td(" entropy ");d\binom{\text { energy in a volume element containing }}{\text { a fixed number, } A, \text { of baryons }}=-p d(\text { volume })+T d(\text { entropy }) ;d( energy in a volume element containing  a fixed number, A, of baryons )=pd( volume )+Td( entropy );
Second law of thermodynamics

i.e.,
d ( ρ A / n ) = p d ( A / n ) + T d ( A s ) d ( ρ A / n ) = p d ( A / n ) + T d ( A s ) d(rho A//n)=-pd(A//n)+Td(As)d(\rho A / n)=-p d(A / n)+T d(A s)d(ρA/n)=pd(A/n)+Td(As)
i.e.,
d ρ = ρ + p n d n + n T d s d ρ = ρ + p n d n + n T d s d rho=(rho+p)/(n)dn+nTdsd \rho=\frac{\rho+p}{n} d n+n T d sdρ=ρ+pndn+nTds
Query: what kind of a "d" appears here? For a simple fluid, the values of two potentials, e.g., n n nnn and s s sss, fix all the others uniquely; so any change in ρ ρ rho\rhoρ must be determined uniquely by the changes in n n nnn and s s sss. It matters not whether the changes are measured along the world line of a given fluid element, or in some other direction. Thus, the " d d ddd " in the first law can be interpreted as an exterior derivative
(22.6) d ρ = ρ + p n d n + n T d s ; (22.6) d ρ = ρ + p n d n + n T d s ; {:(22.6)d rho=(rho+p)/(n)dn+nTds;:}\begin{equation*} \boldsymbol{d} \rho=\frac{\rho+p}{n} \boldsymbol{d} n+n T \boldsymbol{d} s ; \tag{22.6} \end{equation*}(22.6)dρ=ρ+pndn+nTds;
and the changes along a given direction in the fluid (along a given tangent vector v v v\boldsymbol{v}v ) can be written
v ρ d ρ , v = ρ + p n d n , v + n T d s , v = ρ + p n v n + n T v s . v ρ d ρ , v = ρ + p n d n , v + n T d s , v = ρ + p n v n + n T v s . {:[grad_(v)rho-=(:d rho","v:)=(rho+p)/(n)(:dn","v:)+nT(:ds","v:)],[=(rho+p)/(n)grad_(v)n+nTgrad_(v)s.]:}\begin{aligned} \boldsymbol{\nabla}_{\boldsymbol{v}} \rho & \equiv\langle\boldsymbol{d} \rho, \boldsymbol{v}\rangle=\frac{\rho+p}{n}\langle\boldsymbol{d} n, \boldsymbol{v}\rangle+n T\langle\boldsymbol{d} s, \boldsymbol{v}\rangle \\ & =\frac{\rho+p}{n} \boldsymbol{\nabla}_{\boldsymbol{v}} n+n T \boldsymbol{\nabla}_{\boldsymbol{v}} s . \end{aligned}vρdρ,v=ρ+pndn,v+nTds,v=ρ+pnvn+nTvs.
Equation (22.6) lends itself to interpretation in two opposite senses: as a way to deduce the density of mass-energy of the medium from information about pressure (as a function of n n nnn and s s sss ) and temperature (as a function of n n nnn and s s sss ); and conversely, as a way to deduce the two functions p ( n , s ) p ( n , s ) p(n,s)p(n, s)p(n,s) and T ( n , s ) T ( n , s ) T(n,s)T(n, s)T(n,s) from the one function ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s). It is natural to look at the second approach first; who does not like a strategy that makes an intellectual profit? Regarding ρ ρ rho\rhoρ as a known (or calculable) function of n n nnn and s s sss, one deduces from (22.6)
ρ + p n = ( ρ n ) s n T = ( ρ s ) n ρ + p n = ρ n s n T = ρ s n {:[(rho+p)/(n)=((del rho)/(del n))_(s)],[nT=((del rho)/(del s))_(n)]:}\begin{gathered} \frac{\rho+p}{n}=\left(\frac{\partial \rho}{\partial n}\right)_{s} \\ n T=\left(\frac{\partial \rho}{\partial s}\right)_{n} \end{gathered}ρ+pn=(ρn)snT=(ρs)n
and thence pressure and temperature individually,
(22.7a) p ( n , s ) = n ( ρ n ) s ρ (22.7b) T ( n , s ) = 1 n ( ρ s ) n (22.7a) p ( n , s ) = n ρ n s ρ (22.7b) T ( n , s ) = 1 n ρ s n {:[(22.7a)p(n","s)=n((del rho)/(del n))_(s)-rho],[(22.7b)T(n","s)=(1)/(n)((del rho)/(del s))_(n)]:}\begin{gather*} p(n, s)=n\left(\frac{\partial \rho}{\partial n}\right)_{s}-\rho \tag{22.7a}\\ T(n, s)=\frac{1}{n}\left(\frac{\partial \rho}{\partial s}\right)_{n} \tag{22.7b} \end{gather*}(22.7a)p(n,s)=n(ρn)sρ(22.7b)T(n,s)=1n(ρs)n
("two equations of state from one"). The analysis simplifies still further when the fluid, already assumed to be everywhere of the same composition, is also everywhere
endowed with the same entropy per baryon, s s sss, and is in a state of adiabatic flow (no shocks or heat conduction). Then the density ρ = ρ ( n , s ) ρ = ρ ( n , s ) rho=rho(n,s)\rho=\rho(n, s)ρ=ρ(n,s) reduces to a function of one variable out of which one derives everything ( ρ , p , μ ρ , p , μ rho,p,mu\rho, p, \muρ,p,μ ) needed for the hydrodynamics and the gravitation physics of the system (next chapter). Other choices of the "primary thermodynamic potential" are appropriate under other circumstances (see Box 22.1).
If differentiation leads from ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s) to p ( n , s ) p ( n , s ) p(n,s)p(n, s)p(n,s) and T ( n , s ) T ( n , s ) T(n,s)T(n, s)T(n,s), it does not follow that one can take any two functions p ( n , s ) p ( n , s ) p(n,s)p(n, s)p(n,s) and T ( n , s ) T ( n , s ) T(n,s)T(n, s)T(n,s) and proceed "backwards" (by integration) to the "primary function", ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s). To be compatible with the first law of thermodynamics (22.6), the two functions must satisfy the consistency requirement ["Maxwell relation"; equality of second partial derivatives of ρ ρ rho\rhoρ ]
Maxwell relation
(22.7c) ( p / s ) n = n 2 ( T / n ) s . (22.7c) ( p / s ) n = n 2 ( T / n ) s . {:(22.7c)(del p//del s)_(n)=n^(2)(del T//del n)_(s).:}\begin{equation*} (\partial p / \partial s)_{n}=n^{2}(\partial T / \partial n)_{s} . \tag{22.7c} \end{equation*}(22.7c)(p/s)n=n2(T/n)s.

Box 22.1 PRINCIPAL ALTERNATIVES FOR "PRIMARY THERMODYNAMIC POTENTIAL" TO DESCRIBE A FLUID

Primary thermodynamic potential and quantities on which it is most appropriately envisaged to depend
"Secondary" thermodynamic quantities obtained by differentiation of primary with or without use of
d ( ρ n ) + p d ( 1 n ) T d s = 0 d ρ n + p d 1 n T d s = 0 d((rho )/(n))+pd((1)/(n))-Tds=0d\left(\frac{\rho}{n}\right)+p d\left(\frac{1}{n}\right)-T d s=0d(ρn)+pd(1n)Tds=0
"Density"; total amount of massenergy (rest + thermal + + +cdots+\cdots+ ) per unit volume
p ( n , s ) = n ( ρ n ) s ρ p ( n , s ) = n ρ n s ρ p(n,s)=n((del rho)/(del n))_(s)-rhop(n, s)=n\left(\frac{\partial \rho}{\partial n}\right)_{s}-\rhop(n,s)=n(ρn)sρ
ρ = ρ ( n , s ) T ( n , s ) = 1 n ( ρ s ) n , ρ = ρ ( n , s ) T ( n , s ) = 1 n ρ s n , rho=rho(n,s)quad T(n,s)=(1)/(n)((del rho)/(del s))_(n),\rho=\rho(n, s) \quad T(n, s)=\frac{1}{n}\left(\frac{\partial \rho}{\partial s}\right)_{n},ρ=ρ(n,s)T(n,s)=1n(ρs)n,
"Physical free energy"
a ( n , T ) = ρ n T s a ( n , T ) = ρ n T s a(n,T)=(rho )/(n)-Tsa(n, T)=\frac{\rho}{n}-T sa(n,T)=ρnTs
p ( n , T ) = n 2 ( a n ) T s ( n , T ) = ( a T ) n ρ ( n , T ) = n T 2 [ ( a / T ) T ] n p ( n , T ) = n 2 a n T s ( n , T ) = a T n ρ ( n , T ) = n T 2 ( a / T ) T n {:[p(n","T)=n^(2)((del a)/(del n))_(T)],[s(n","T)=-((del a)/(del T))_(n)],[rho(n","T)=-nT^(2)[(del(a//T))/(del T)]_(n)]:}\begin{aligned} p(n, T) & =n^{2}\left(\frac{\partial a}{\partial n}\right)_{T} \\ s(n, T) & =-\left(\frac{\partial a}{\partial T}\right)_{n} \\ \rho(n, T) & =-n T^{2}\left[\frac{\partial(a / T)}{\partial T}\right]_{n} \end{aligned}p(n,T)=n2(an)Ts(n,T)=(aT)nρ(n,T)=nT2[(a/T)T]n
"Chemical free energy"
f ( p , T ) = ρ + p n T s f ( p , T ) = ρ + p n T s f(p,T)=(rho+p)/(n)-Tsf(p, T)=\frac{\rho+p}{n}-T sf(p,T)=ρ+pnTs
1 / n ( p , T ) = ( f / p ) T s ( p , T ) = ( f / T ) p ρ ( p , T ) = f T ( f / T ) p ( f / p ) T p 1 / n ( p , T ) = ( f / p ) T s ( p , T ) = ( f / T ) p ρ ( p , T ) = f T ( f / T ) p ( f / p ) T p {:[1//n(p","T)=(del f//del p)_(T)],[s(p","T)=-(del f//del T)_(p)],[rho(p","T)=(f-T(del f//del T)_(p))/((del f//del p)_(T))-p]:}\begin{aligned} 1 / n(p, T) & =(\partial f / \partial p)_{T} \\ s(p, T) & =-(\partial f / \partial T)_{p} \\ \rho(p, T) & =\frac{f-T(\partial f / \partial T)_{p}}{(\partial f / \partial p)_{T}}-p \end{aligned}1/n(p,T)=(f/p)Ts(p,T)=(f/T)pρ(p,T)=fT(f/T)p(f/p)Tp
"Chemical potential" ("energy to inject" expressed on a "per baryon" basis)
μ ( p , s ) = p + ρ n μ ( p , s ) = p + ρ n mu(p,s)=(p+rho)/(n)\mu(p, s)=\frac{p+\rho}{n}μ(p,s)=p+ρn
1 / n ( p , s ) = ( μ / p ) s T ( p , s ) = ( μ / s ) p ρ ( p , s ) = μ ( μ / p ) s p 1 / n ( p , s ) = ( μ / p ) s T ( p , s ) = ( μ / s ) p ρ ( p , s ) = μ ( μ / p ) s p {:[1//n(p","s)=(del mu//del p)_(s)],[T(p","s)=(del mu//del s)_(p)],[rho(p","s)=(mu)/((del mu//del p)_(s))-p]:}\begin{aligned} 1 / n(p, s) & =(\partial \mu / \partial p)_{s} \\ T(p, s) & =(\partial \mu / \partial s)_{p} \\ \rho(p, s) & =\frac{\mu}{(\partial \mu / \partial p)_{s}}-p \end{aligned}1/n(p,s)=(μ/p)sT(p,s)=(μ/s)pρ(p,s)=μ(μ/p)sp
Conditions under which convenient, appropriate, and relevant
Chemical potential equals
"injection energy" at fixed entropy per baryon and total volume
Laws of hydrodynamics for simple fluid without heat flow or viscosity:
The chemical potential μ μ mu\muμ is also a unique function of n n nnn and s s sss. It is defined as follows. (1) Take a sample of the simple fluid in a fixed thermodynamic state (fixed n n nnn and s s sss ). (2) Take, separately, a much smaller sample of the same fluid, containing δ A δ A delta A\delta AδA baryons in the same thermodynamic state as the large sample (same n n nnn and s s sss ). (3) Inject the smaller sample into the larger one, holding the volume of the large sample fixed during the injection process. (4) The total mass-energy injected,
δ M injected = ρ × ( volume of injected fluid ) = ρ ( δ A / n ) , δ M injected = ρ × (  volume of injected fluid  ) = ρ ( δ A / n ) , deltaM_(injected)=rho xx(" volume of injected fluid ")=rho(delta A//n),\delta M_{\mathrm{injected}}=\rho \times(\text { volume of injected fluid })=\rho(\delta A / n),δMinjected=ρ×( volume of injected fluid )=ρ(δA/n),
plus the work required to perform the injection
δ W injection = ( work done against pressure of large sample to open up space in it for the injected fluid ) = p ( volume of injected fluid ) = p ( δ A / n ) , δ W injection  = (  work done against pressure of large sample   to open up space in it for the injected fluid  ) = p (  volume of injected fluid  ) = p ( δ A / n ) , {:[deltaW_("injection ")=((" work done against pressure of large sample ")/(" to open up space in it for the injected fluid "))],[=p(" volume of injected fluid ")=p(delta A//n)","]:}\begin{aligned} \delta W_{\text {injection }} & =\binom{\text { work done against pressure of large sample }}{\text { to open up space in it for the injected fluid }} \\ & =p(\text { volume of injected fluid })=p(\delta A / n), \end{aligned}δWinjection =( work done against pressure of large sample  to open up space in it for the injected fluid )=p( volume of injected fluid )=p(δA/n),
is equal to μ δ A μ δ A mu delta A\mu \delta AμδA :
μ δ A = δ M injected + δ W injection = ρ + p n δ A . μ δ A = δ M injected  + δ W injection  = ρ + p n δ A . mu delta A=deltaM_("injected ")+deltaW_("injection ")=(rho+p)/(n)delta A.\mu \delta A=\delta M_{\text {injected }}+\delta W_{\text {injection }}=\frac{\rho+p}{n} \delta A .μδA=δMinjected +δWinjection =ρ+pnδA.
Stated more briefly:
(22.8) μ = ( total mass-energy required, per baryon, to "create" and inject a small additional amount of fluid into a given sample, without changing s or volume of the sample ) = ρ + p n = ( ρ n ) s . [ by first law of thermodynamics (22.6)] (22.8) μ =  total mass-energy required, per baryon, to "create" and   inject a small additional amount of fluid into a given   sample, without changing  s  or volume of the sample  = ρ + p n = ρ n s . [  by first law of thermodynamics (22.6)]  {:[(22.8)mu=([" total mass-energy required, per baryon, to "create" and "],[" inject a small additional amount of fluid into a given "],[" sample, without changing "s" or volume of the sample "])],[=(rho+p)/(n)=((del rho)/(del n))_(s).],[[" by first law of thermodynamics (22.6)] "]:}\begin{align*} \mu & =\left(\begin{array}{l} \text { total mass-energy required, per baryon, to "create" and } \\ \text { inject a small additional amount of fluid into a given } \\ \text { sample, without changing } s \text { or volume of the sample } \end{array}\right) \tag{22.8}\\ & =\frac{\rho+p}{n}=\left(\frac{\partial \rho}{\partial n}\right)_{s} . \\ & {[\text { by first law of thermodynamics (22.6)] }} \end{align*}(22.8)μ=( total mass-energy required, per baryon, to "create" and  inject a small additional amount of fluid into a given  sample, without changing s or volume of the sample )=ρ+pn=(ρn)s.[ by first law of thermodynamics (22.6)] 
All the above laws and equations of thermodynamics are the same in curved spacetime as in flat spacetime; and the same in (relativistic) flat spacetime as in classical nonrelativistic thermodynamics-except for the inclusion of rest mass, together with all other forms of mass-energy, in ρ ρ rho\rhoρ and μ μ mu\muμ. The reason is simple: the laws are all formulated as scalar equations linking thermodynamic variables that one measures in the rest frame of the fluid.

§22.3. HYDRODYNAMICS IN CURVED SPACETIME*

A simple perfect fluid flows through spacetime. It might be the Earth's atmosphere circulating in the Earth's gravitational field. It might be the gaseous interior of the Sun at rest in its own gravitational field. It might be interstellar gas accreting onto a black hole. But whatever and wherever the fluid may be, its motion will be governed by the curved-spacetime laws of thermodynamics ( $ 22.2 $ 22.2 $22.2\$ 22.2$22.2 ) plus the local
law of energy-momentum conservation, T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0. The chief objective of this section is to reduce the equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 to usable form. The reduction will be performed in the text using abstract notation; the reader is encouraged to repeat the reduction using index notation.
The stress-energy tensor for a perfect fluid, in curved spacetime as in flat (equivalence principle!), is
(22.9) T = ( ρ + p ) u u + p g (22.9) T = ( ρ + p ) u u + p g {:(22.9)T=(rho+p)u ox u+pg:}\boldsymbol{T}=(\rho+p) \boldsymbol{u} \otimes \boldsymbol{u}+p \boldsymbol{g} \tag{22.9}(22.9)T=(ρ+p)uu+pg
(See §5.5.) Its divergence is readily calculated using the chain rule; using the compatibility relation between g g g\boldsymbol{g}g and , g = 0 , g = 0 grad,grad g=0\boldsymbol{\nabla}, \boldsymbol{\nabla} \boldsymbol{g}=0,g=0; using the identity ( p ) g = p ( p ) g = p (grad p)*g=grad p(\boldsymbol{\nabla} p) \cdot \boldsymbol{g}=\boldsymbol{\nabla} p(p)g=p (which one readily verifies in index notation); and using
0 = T = [ ( ρ + p ) u ] u + [ ( ρ + p ) u ] u + [ ( ρ + p ) u ] u + ( p ) g Q [ divergence on first slot] (22.10) = [ u ρ + u p + ( ρ + p ) u ] u + ( ρ + p ) u u + p . 0 = T = [ ( ρ + p ) u ] u + [ ( ρ + p ) u ] u + [ ( ρ + p ) u ] u + ( p ) g  Q  [ divergence on first slot]  (22.10) = u ρ + u p + ( ρ + p ) u u + ( ρ + p ) u u + p . {:[0=grad*T=[grad(rho+p)*u]u+[(rho+p)grad*u]u+[(rho+p)u]*grad u+(grad p)*g],[" Q "_(["divergence on first slot] ")],[(22.10)=[grad_(u)rho+grad_(u)p+(rho+p)grad*u]u+(rho+p)grad_(u)u+grad^(')p.]:}\begin{align*} 0 & =\boldsymbol{\nabla} \cdot \boldsymbol{T}=[\boldsymbol{\nabla}(\rho+p) \cdot \boldsymbol{u}] \boldsymbol{u}+[(\rho+p) \boldsymbol{\nabla} \cdot \boldsymbol{u}] \boldsymbol{u}+[(\rho+p) \boldsymbol{u}] \cdot \boldsymbol{\nabla} \boldsymbol{u}+(\boldsymbol{\nabla} p) \cdot \boldsymbol{g} \\ & \text { Q }_{[\text {divergence on first slot] }} \\ & =\left[\boldsymbol{\nabla}_{u} \rho+\nabla_{\boldsymbol{u}} p+(\rho+p) \boldsymbol{\nabla} \cdot \boldsymbol{u}\right] \boldsymbol{u}+(\rho+p) \boldsymbol{\nabla}_{u} \boldsymbol{u}+\mathbf{\nabla}^{\prime} p . \tag{22.10} \end{align*}0=T=[(ρ+p)u]u+[(ρ+p)u]u+[(ρ+p)u]u+(p)g Q [divergence on first slot] (22.10)=[uρ+up+(ρ+p)u]u+(ρ+p)uu+p.
The component of this equation along the 4 -velocity is especially simple (recall that u u u = 1 2 u u 2 = 0 u u u = 1 2 u u 2 = 0 u*grad_(u)u=(1)/(2)grad_(u)u^(2)=0\boldsymbol{u} \cdot \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=\frac{1}{2} \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}^{2}=0uuu=12uu2=0 because u 2 1 u 2 1 u^(2)-=-1\boldsymbol{u}^{2} \equiv-1u21 ):
0 = u ( T ) = [ u ρ + u p + ( ρ + p ) u ] + u p = u ρ ( ρ + p ) u . 0 = u ( T ) = u ρ + u p + ( ρ + p ) u + u p = u ρ ( ρ + p ) u . {:[0=u*(grad*T)=-[grad_(u)rho+grad_(u)p+(rho+p)grad*u]+grad_(u)p],[=-grad_(u)rho-(rho+p)grad*u.]:}\begin{aligned} 0 & =\boldsymbol{u} \cdot(\boldsymbol{\nabla} \cdot \boldsymbol{T})=-\left[\boldsymbol{\nabla}_{\boldsymbol{u}} \rho+\boldsymbol{\nabla}_{\boldsymbol{u}} p+(\rho+p) \boldsymbol{\nabla} \cdot \boldsymbol{u}\right]+\boldsymbol{\nabla}_{\boldsymbol{u}} p \\ & =-\boldsymbol{\nabla}_{\boldsymbol{u}} \rho-(\rho+p) \boldsymbol{\nabla} \cdot \boldsymbol{u} . \end{aligned}0=u(T)=[uρ+up+(ρ+p)u]+up=uρ(ρ+p)u.
Combine this with the equation of baryon conservation (22.3); the result is
(22.11a) d ρ d τ = ( ρ + p ) n d n d τ (22.11a) d ρ d τ = ( ρ + p ) n d n d τ {:(22.11a)(d rho)/(d tau)=((rho+p))/(n)(dn)/(d tau):}\begin{equation*} \frac{d \rho}{d \tau}=\frac{(\rho+p)}{n} \frac{d n}{d \tau} \tag{22.11a} \end{equation*}(22.11a)dρdτ=(ρ+p)ndndτ
(2) Local energy conservation: adiabaticity of flow
Notice that this is identical to the first law of thermodynamics (22.6) applied along a flow line, plus the assumption that the entropy per baryon is conserved along a flow line
(22.11b) d s / d τ = 0 (22.11b) d s / d τ = 0 {:(22.11b)ds//d tau=0:}\begin{equation*} d s / d \tau=0 \tag{22.11b} \end{equation*}(22.11b)ds/dτ=0
There is no reason for surprise at this result. To insist on thermodynamic equilibrium and to demand that the entropy remain constant is to require zero exchange of heat between one element of the fluid and another. But the stress-energy tensor (22.9) recognizes that heat exchange is absent. Any heat exchange would show up as an energy flux term in T T T\boldsymbol{T}T (Ex. 22.7); but no such term is present. Consequently, when one studies local energy conservation by evaluating u ( T ) = 0 u ( T ) = 0 u*(grad*T)=0\boldsymbol{u} \cdot(\boldsymbol{\nabla} \cdot \boldsymbol{T})=0u(T)=0, the stress-energy tensor reports that no heat flow is occurring-i.e. that d s / d τ = 0 d s / d τ = 0 ds//d tau=0d s / d \tau=0ds/dτ=0.
Three components of T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 remain: the components orthogonal to the fluid's 4 -velocity. One can pluck them out of T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0, leaving behind the component along u u u\boldsymbol{u}u, by use of the "projection tensor"
(22.12) P g + u u (22.12) P g + u u {:(22.12)P-=g+u ox u:}\begin{equation*} P \equiv g+u \otimes u \tag{22.12} \end{equation*}(22.12)Pg+uu

Box 22.2 THERMODYNAMICS AND HYDRODYNAMICS FOR A SIMPLE PERFECT FLUID IN CURVED SPACETIME

A. Ten Quantities Characterize the Fluid

Thermodynamic potentials all measured in rest frame
n n nnn, baryon number density
ρ ρ rho\rhoρ, density of total mass-energy
p p ppp, pressure
T T TTT, temperature
s s sss, entropy per baryon
μ μ mu\muμ, chemical potential per baryon
Four components of the fluid 4-velocity
B. Ten Equations Govern the Fluid's Motion
Two equations of state
(1) p = p ( n , s ) , T = T ( n , s ) (1) p = p ( n , s ) , T = T ( n , s ) {:(1)p=p(n","s)","quad T=T(n","s):}\begin{equation*} p=p(n, s), \quad T=T(n, s) \tag{1} \end{equation*}(1)p=p(n,s),T=T(n,s)
subject to the compatibility constraint ("Maxwell relation," which follows from first law of thermodynamics)
( p / s ) n = n 2 ( T / n ) s ( p / s ) n = n 2 ( T / n ) s (del p//del s)_(n)=n^(2)(del T//del n)_(s)(\partial p / \partial s)_{n}=n^{2}(\partial T / \partial n)_{s}(p/s)n=n2(T/n)s
First law of thermodynamics
(3) d ρ = ρ + p n d n + n T d s (3) d ρ = ρ + p n d n + n T d s {:(3)d rho=(rho+p)/(n)dn+nTds:}\begin{equation*} \boldsymbol{d} \rho=\frac{\rho+p}{n} \boldsymbol{d} n+n T \boldsymbol{d} s \tag{3} \end{equation*}(3)dρ=ρ+pndn+nTds
which can be integrated to give ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s).
Equation for chemical potential
(4) μ = ( ρ + p ) / n (4) μ = ( ρ + p ) / n {:(4)mu=(rho+p)//n:}\begin{equation*} \mu=(\rho+p) / n \tag{4} \end{equation*}(4)μ=(ρ+p)/n
which can be combined with ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s) and p ( n , s ) p ( n , s ) p(n,s)p(n, s)p(n,s) to give μ ( n , s ) μ ( n , s ) mu(n,s)\mu(n, s)μ(n,s).
Law of baryon conservation
(5) d n / d τ u n = n u (5) d n / d τ u n = n u {:(5)dn//d tau-=grad_(u)n=-n grad*u:}\begin{equation*} d n / d \tau \equiv \boldsymbol{\nabla}_{u} n=-n \boldsymbol{\nabla} \cdot \boldsymbol{u} \tag{5} \end{equation*}(5)dn/dτun=nu
Conservation of energy along flow lines, which (assuming no energy exchange between adjacent fluid elements) means "adiabatic flow"
d s / d τ = 0 d s / d τ = 0 ds//d tau=0d s / d \tau=0ds/dτ=0 except in shock waves, where d s / d τ > 0 d s / d τ > 0 ds//d tau > 0d s / d \tau>0ds/dτ>0.
[Shock waves are not treated in this book; see Taub (1948), de Hoffman and Teller (1950), Israel (1960), May and White (1967), Zel'dovich and Rayzer (1967); Lichnerowicz (1967, 1971); and Thorne (1973a).]
Euler equations
(7) ( ρ + p ) u u = ( g + u u ) p (7) ( ρ + p ) u u = ( g + u u ) p {:(7)(rho+p)grad_(u)u=-(g+u ox u)*grad p:}\begin{equation*} (\rho+p) \boldsymbol{\nabla}_{u} \boldsymbol{u}=-(\boldsymbol{g}+\boldsymbol{u} \otimes \boldsymbol{u}) \cdot \boldsymbol{\nabla} \boldsymbol{p} \tag{7} \end{equation*}(7)(ρ+p)uu=(g+uu)p
which determine the flow lines to which u u u\boldsymbol{u}u is tangent.
Normalization of 4-velocity
(10) u u = 1 (10) u u = 1 {:(10)u*u=-1:}\begin{equation*} \boldsymbol{u} \cdot \boldsymbol{u}=-1 \tag{10} \end{equation*}(10)uu=1
(3) Euler equation
(See exercise 22.4.) Contracting P P P\boldsymbol{P}P with T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 [equation (22.10)] gives
(22.13) ( ρ + p ) u u = P ( p ) [ p + ( u p ) u ] (22.13) ( ρ + p ) u u = P ( p ) p + u p u {:(22.13)(rho+p)grad_(u)u=-P*(grad p)-=-[grad p+(grad_(u)p)u]:}\begin{equation*} (\rho+p) \boldsymbol{\nabla}_{u} \boldsymbol{u}=-\boldsymbol{P} \cdot(\boldsymbol{\nabla} p) \equiv-\left[\boldsymbol{\nabla} p+\left(\boldsymbol{\nabla}_{u} p\right) \boldsymbol{u}\right] \tag{22.13} \end{equation*}(22.13)(ρ+p)uu=P(p)[p+(up)u]
This is the "Euler equation" of relativistic hydrodynamics. It has precisely the same form as the corresponding flat-spacetime Euler equation:
( inertial mass per unit volume [exercise 5.4] ) × ( 4 -acceleration of fluid ) = ( pressure gradient in the 3-surface orthogonal to 4-velocity )  inertial mass   per unit volume   [exercise 5.4]  × ( 4 -acceleration   of fluid  ) =  pressure gradient   in the 3-surface   orthogonal to 4-velocity  ([" inertial mass "],[" per unit volume "],[" [exercise 5.4] "])xx((4"-acceleration ")/(" of fluid "))=-([" pressure gradient "],[" in the 3-surface "],[" orthogonal to 4-velocity "])\left(\begin{array}{l}\text { inertial mass } \\ \text { per unit volume } \\ \text { [exercise 5.4] }\end{array}\right) \times\binom{ 4 \text {-acceleration }}{\text { of fluid }}=-\left(\begin{array}{l}\text { pressure gradient } \\ \text { in the 3-surface } \\ \text { orthogonal to 4-velocity }\end{array}\right)( inertial mass  per unit volume  [exercise 5.4] )×(4-acceleration  of fluid )=( pressure gradient  in the 3-surface  orthogonal to 4-velocity ).
The pressure gradient, not "gravity," is responsible for all deviation of flow lines from geodesics.
Box 22.2 reorganizes and summarizes the above laws of thermodynamics and hydrodynamics.

Exercise 22.1. DIVERGENCE OF FLOW LINES PRODUCES VOLUME CHANGES

EXERCISES

Derive the equation d V / d τ = ( u ) V d V / d τ = ( u ) V dV//d tau=(grad*u)Vd V / d \tau=(\nabla \cdot \boldsymbol{u}) VdV/dτ=(u)V [equation (22.2)] for the rate of change of volume of a fluid element. [Hint: Pick an event P 0 P 0 P_(0)\mathscr{P}_{0}P0, and calculate in a local Lorentz frame at P 0 P 0 P_(0)\mathscr{P}_{0}P0 which momentarily moves with the fluid ("rest frame at P 0 P 0 P_(0)\mathscr{P}_{0}P0 ").] [Solution: At events near P 0 P 0 P_(0)\mathscr{P}_{0}P0 the fluid has a very small ordinary velocity v j = d x j / d t v j = d x j / d t v^(j)=dx^(j)//dtv^{j}=d x^{j} / d tvj=dxj/dt. Consequently a cube of fluid at P 0 P 0 P_(0)\mathscr{P}_{0}P0 with edges Δ x = Δ y = Δ z = L Δ x = Δ y = Δ z = L Delta x=Delta y=Delta z=L\Delta x=\Delta y=\Delta z=LΔx=Δy=Δz=L changes its edges, after time δ t δ t delta t\delta tδt, by the amounts
δ ( Δ x ) = [ ( d x / d t ) δ t ] at "front face" [ ( d x / d t ) δ t ] at "back face" = ( v x / x ) L δ t , δ ( Δ y ) = ( v y / y ) L δ t , δ ( Δ z ) = ( v z / z ) L δ t . δ ( Δ x ) = [ ( d x / d t ) δ t ] at  "front face"  [ ( d x / d t ) δ t ] at  "back face"  = v x / x L δ t , δ ( Δ y ) = v y / y L δ t , δ ( Δ z ) = v z / z L δ t . {:[delta(Delta x)=[(dx//dt)delta t]_(at)" "front face" "-[(dx//dt)delta t]_(at)" "back face" "],[=(delv^(x)//del x)L delta t","],[delta(Delta y)=(delv^(y)//del y)L delta t","],[delta(Delta z)=(delv^(z)//del z)L delta t.]:}\begin{aligned} \delta(\Delta x) & =[(d x / d t) \delta t]_{\mathrm{at}} \text { "front face" }-[(d x / d t) \delta t]_{\mathrm{at}} \text { "back face" } \\ & =\left(\partial v^{x} / \partial x\right) L \delta t, \\ \delta(\Delta y) & =\left(\partial v^{y} / \partial y\right) L \delta t, \\ \delta(\Delta z) & =\left(\partial v^{z} / \partial z\right) L \delta t . \end{aligned}δ(Δx)=[(dx/dt)δt]at "front face" [(dx/dt)δt]at "back face" =(vx/x)Lδt,δ(Δy)=(vy/y)Lδt,δ(Δz)=(vz/z)Lδt.
The corresponding change in volume is
δ ( Δ x Δ y Δ z ) = ( v j / x j ) L 3 δ t δ ( Δ x Δ y Δ z ) = v j / x j L 3 δ t delta(Delta x Delta y Delta z)=(delv^(j)//delx^(j))L^(3)delta t\delta(\Delta x \Delta y \Delta z)=\left(\partial v^{j} / \partial x^{j}\right) L^{3} \delta tδ(ΔxΔyΔz)=(vj/xj)L3δt
so the rate of change of volume is
V / t = V ( v j / x j ) V / t = V v j / x j del V//del t=V(delv^(j)//delx^(j))\partial V / \partial t=V\left(\partial v^{j} / \partial x^{j}\right)V/t=V(vj/xj)
But in the local Lorentz rest frame at and near P 0 P 0 P_(0)\mathscr{P}_{0}P0 (where x α = 0 x α = 0 x^(alpha)=0x^{\alpha}=0xα=0 ), the metric coefficients are g μ ν = η μ ν + 0 ( | x α | 2 ) g μ ν = η μ ν + 0 x α 2 g_(mu nu)=eta_(mu nu)+0(|x^(alpha)|^(2))g_{\mu \nu}=\eta_{\mu \nu}+0\left(\left|x^{\alpha}\right|^{2}\right)gμν=ημν+0(|xα|2), and the ordinary velocity is v j = 0 ( | x α | ) v j = 0 x α v^(j)=0(|x^(alpha)|)v^{j}=0\left(\left|x^{\alpha}\right|\right)vj=0(|xα|); so
u 0 = d t d τ = d t ( g μ ν d x μ d x ν ) 1 / 2 = 1 + 0 ( | x α | 2 ) u j = d x j d τ = v j + 0 ( x α 3 ) u 0 = d t d τ = d t g μ ν d x μ d x ν 1 / 2 = 1 + 0 x α 2 u j = d x j d τ = v j + 0 x α 3 {:[u^(0)=(dt)/(d tau)=(dt)/((-g_(mu nu)dx^(mu)dx^(nu))^(1//2))=1+0(|x^(alpha)|^(2))],[u^(j)=(dx^(j))/(d tau)=v^(j)+0(∣x^(alpha∣3))]:}\begin{gathered} u^{0}=\frac{d t}{d \tau}=\frac{d t}{\left(-g_{\mu \nu} d x^{\mu} d x^{\nu}\right)^{1 / 2}}=1+0\left(\left|x^{\alpha}\right|^{2}\right) \\ u^{j}=\frac{d x^{j}}{d \tau}=v^{j}+0\left(\mid x^{\alpha \mid 3}\right) \end{gathered}u0=dtdτ=dt(gμνdxμdxν)1/2=1+0(|xα|2)uj=dxjdτ=vj+0(xα3)
Thus, the derivatives V / t V / t del V//del t\partial V / \partial tV/t and V ( v j / x j ) V v j / x j V(delv^(j)//delx^(j))V\left(\partial v^{j} / \partial x^{j}\right)V(vj/xj) at P 0 P 0 P_(0)\mathscr{P}_{0}P0 are
V / t = u α V / x α = u α V , α = d V / d τ = V ( v j / x j ) = V ( u α / x α ) = V u α ; α = V ( u ) . Q.E.D.] V / t = u α V / x α = u α V , α = d V / d τ = V v j / x j = V u α / x α = V u α ; α = V ( u ) .  Q.E.D.]  {:[del V//del t=u^(alpha)del V//delx^(alpha)=u^(alpha)V_(,alpha)=dV//d tau],[=V(delv^(j)//delx^(j))=V(delu^(alpha)//delx^(alpha))=Vu^(alpha)_(;alpha)=V(grad*u).quad" Q.E.D.] "]:}\begin{aligned} \partial V / \partial t & =u^{\alpha} \partial V / \partial x^{\alpha}=u^{\alpha} V_{, \alpha}=d V / d \tau \\ & =V\left(\partial v^{j} / \partial x^{j}\right)=V\left(\partial u^{\alpha} / \partial x^{\alpha}\right)=V u^{\alpha}{ }_{; \alpha}=V(\boldsymbol{\nabla} \cdot \boldsymbol{u}) . \quad \text { Q.E.D.] } \end{aligned}V/t=uαV/xα=uαV,α=dV/dτ=V(vj/xj)=V(uα/xα)=Vuα;α=V(u). Q.E.D.] 
[Note that by working in flat spacetime, one could have inferred more easily that V / t = V / t = del V//del t=\partial V / \partial t=V/t= d V / d τ d V / d τ dV//d taud V / d \taudV/dτ and v j / x j = u v j / x j = u delv^(j)//delx^(j)=grad*u\partial v^{j} / \partial x^{j}=\boldsymbol{\nabla} \cdot \boldsymbol{u}vj/xj=u; one would then have concluded d V / d τ = ( u ) V d V / d τ = ( u ) V dV//d tau=(grad*u)Vd V / d \tau=(\boldsymbol{\nabla} \cdot \boldsymbol{u}) VdV/dτ=(u)V; and one could have invoked the equivalence principle to move this law into curved spacetime.]

Exercise 22.2. EQUATION OF CONTINUITY

Show that in the nonrelativistic limit in flat spacetime the equation of baryon conservation (22.3) becomes the "equation of continuity"
n t + x j ( n v j ) = 0 n t + x j n v j = 0 (del n)/(del t)+(del)/(delx^(j))(nv^(j))=0\frac{\partial n}{\partial t}+\frac{\partial}{\partial x^{j}}\left(n v^{j}\right)=0nt+xj(nvj)=0

Exercise 22.3. CHEMICAL POTENTIAL FOR IDEAL FERMI GAS

Show that the chemical potential of an ideal Fermi gas, nonrelativistic or relativistic, is (at zero temperature) equal to the Fermi energy (energy of highest occupied momentum state) of that gas.

Exercise 22.4. PROJECTION TENSORS

Show that contraction of a tangent vector B B B\boldsymbol{B}B with the "projection tensor" P g + u u P g + u u P-=g+u ox u\boldsymbol{P} \equiv \boldsymbol{g}+\boldsymbol{u} \otimes \boldsymbol{u}Pg+uu projects B B B\boldsymbol{B}B into the 3 -surface orthogonal to the 4 -velocity vector u u u\boldsymbol{u}u. [Hint: perform the
calculation in an orthonormal frame with e 0 ^ = u e 0 ^ = u e_( hat(0))=u\boldsymbol{e}_{\hat{0}}=\boldsymbol{u}e0^=u, and write B = B α e α ^ B = B α e α ^ B=B^(alpha)e_( hat(alpha))\boldsymbol{B}=B^{\alpha} \boldsymbol{e}_{\hat{\alpha}}B=Bαeα^; then show that P B = B j e j . I P B = B j e j . I  P*B=B^(j)e_(j". I ")\boldsymbol{P} \cdot \boldsymbol{B}=B^{j} \boldsymbol{e}_{j \text {. I }}PB=Bjej. I . If n n n\boldsymbol{n}n is a unit spacelike vector, show that P g n n P g n n P-=g-n ox n\boldsymbol{P} \equiv \boldsymbol{g}-\boldsymbol{n} \otimes \boldsymbol{n}Pgnn is the corresponding projection operator. Note: There is no unique concept of "the projection orthogonal to a null vector." Why? [Hint: draw pictures in flat spacetime suppressing one spatial dimension.]

Exercise 22.5. PRESSURE GRADIENT IN STATIONARY GRAVITATIONAL FIELD

A perfect fluid is at rest (flow lines have x j = x j = x^(j)=x^{j}=xj= constant) in a stationary gravitational field (metric coefficients are independent of x 0 x 0 x^(0)x^{0}x0 ). Show that the pressure gradient required to "support the fluid against gravity" (i.e., to make its flow lines be x j = x j = x^(j)=x^{j}=xj= constant instead of geodesics) is
(22.14) p x 0 = 0 , p x j = ( ρ + p ) ln g 00 x j (22.14) p x 0 = 0 , p x j = ( ρ + p ) ln g 00 x j {:(22.14)(del p)/(delx^(0))=0","quad(del p)/(delx^(j))=-(rho+p)(del ln sqrt(-g_(00)))/(delx^(j)):}\begin{equation*} \frac{\partial p}{\partial x^{0}}=0, \quad \frac{\partial p}{\partial x^{j}}=-(\rho+p) \frac{\partial \ln \sqrt{-g_{00}}}{\partial x^{j}} \tag{22.14} \end{equation*}(22.14)px0=0,pxj=(ρ+p)lng00xj
Evaluate this pressure gradient in the Newtonian limit, using the coordinate system and metric coefficients of equation ( 18.15 c ).

Exercise 22.6. EXPANSION, ROTATION, AND SHEAR

Let a field of fluid 4 -velocities u ( P ) u ( P ) u(P)\boldsymbol{u}(\mathscr{P})u(P) be given.
(a) Show that u u grad u\boldsymbol{\nabla} \boldsymbol{u}u can be decomposed in the following manner:
(22.15a) u α ; β = ω α β + σ α β + 1 3 θ P α β a α u β (22.15a) u α ; β = ω α β + σ α β + 1 3 θ P α β a α u β {:(22.15a)u_(alpha;beta)=omega_(alpha beta)+sigma_(alpha beta)+(1)/(3)thetaP_(alpha beta)-a_(alpha)u_(beta):}\begin{equation*} u_{\alpha ; \beta}=\omega_{\alpha \beta}+\sigma_{\alpha \beta}+\frac{1}{3} \theta P_{\alpha \beta}-a_{\alpha} u_{\beta} \tag{22.15a} \end{equation*}(22.15a)uα;β=ωαβ+σαβ+13θPαβaαuβ
where a a a\boldsymbol{a}a is the 4-acceleration of the fluid
(22.15b) a α u α ; β u β (22.15b) a α u α ; β u β {:(22.15b)a_(alpha)-=u_(alpha;beta)u^(beta):}\begin{equation*} a_{\alpha} \equiv u_{\alpha ; \beta} u^{\beta} \tag{22.15b} \end{equation*}(22.15b)aαuα;βuβ
θ θ theta\thetaθ is the "expansion" of the fluid world lines
(22.15c) θ u = u ; α α (22.15c) θ u = u ; α α {:(22.15c)theta-=grad*u=u_(;alpha)^(alpha):}\begin{equation*} \theta \equiv \boldsymbol{\nabla} \cdot \boldsymbol{u}=u_{; \alpha}^{\alpha} \tag{22.15c} \end{equation*}(22.15c)θu=u;αα
P α β P α β P_(alpha beta)P_{\alpha \beta}Pαβ is the projection tensor
(22.15d) P α β g α β + u α u β (22.15d) P α β g α β + u α u β {:(22.15d)P_(alpha beta)-=g_(alpha beta)+u_(alpha)u_(beta):}\begin{equation*} P_{\alpha \beta} \equiv g_{\alpha \beta}+u_{\alpha} u_{\beta} \tag{22.15d} \end{equation*}(22.15d)Pαβgαβ+uαuβ
σ α β σ α β sigma_(alpha beta)\sigma_{\alpha \beta}σαβ is the shear tensor of the fluid
(22.15e) σ α β 1 2 ( u α ; μ P β μ + u β ; μ P α μ ) 1 3 θ P α β (22.15e) σ α β 1 2 u α ; μ P β μ + u β ; μ P α μ 1 3 θ P α β {:(22.15e)sigma_(alpha beta)-=(1)/(2)(u_(alpha;mu)P_(beta)^(mu)+u_(beta;mu)P_(alpha)^(mu))-(1)/(3)thetaP_(alpha beta):}\begin{equation*} \sigma_{\alpha \beta} \equiv \frac{1}{2}\left(u_{\alpha ; \mu} P_{\beta}^{\mu}+u_{\beta ; \mu} P_{\alpha}^{\mu}\right)-\frac{1}{3} \theta P_{\alpha \beta} \tag{22.15e} \end{equation*}(22.15e)σαβ12(uα;μPβμ+uβ;μPαμ)13θPαβ
and ω α β ω α β omega_(alpha beta)\omega_{\alpha \beta}ωαβ is the rotation 2-form of the fluid
(22.15f) ω α β 1 2 ( u α ; μ P β μ u β ; μ P α μ ) (22.15f) ω α β 1 2 u α ; μ P β μ u β ; μ P α μ {:(22.15f)omega_(alpha beta)-=(1)/(2)(u_(alpha;mu)P_(beta)^(mu)-u_(beta;mu)P_(alpha)^(mu)):}\begin{equation*} \omega_{\alpha \beta} \equiv \frac{1}{2}\left(u_{\alpha ; \mu} P_{\beta}^{\mu}-u_{\beta ; \mu} P_{\alpha}^{\mu}\right) \tag{22.15f} \end{equation*}(22.15f)ωαβ12(uα;μPβμuβ;μPαμ)
(b) Each of the component parts of this decomposition has a simple physical interpretation in the local rest frames of the fluid. The interpretation of the 4-acceleration a a a\boldsymbol{a}a in terms of accelerometer readings should be familiar. Exercise 22.1 showed that the expansion θ = u θ = u theta=grad*u\theta=\boldsymbol{\nabla} \cdot \boldsymbol{u}θ=u describes the rate of increase of the volume of a fluid element,
(22.15~g) θ = ( 1 / V ) ( d V / d τ ) (22.15~g) θ = ( 1 / V ) ( d V / d τ ) {:(22.15~g)theta=(1//V)(dV//d tau):}\begin{equation*} \theta=(1 / V)(d V / d \tau) \tag{22.15~g} \end{equation*}(22.15~g)θ=(1/V)(dV/dτ)
Exercise 22.4 explored the meaning and use of the projection tensor P P P\boldsymbol{P}P. Verify that in a local Lorentz frame ( g α ^ β ^ = η α β , Γ α ^ β ^ γ ^ = 0 g α ^ β ^ = η α β , Γ α ^ β ^ γ ^ = 0 g_( hat(alpha) hat(beta))=eta_(alpha beta),Gamma^( hat(alpha))_( hat(beta) hat(gamma))=0g_{\hat{\alpha} \hat{\beta}}=\eta_{\alpha \beta}, \Gamma^{\hat{\alpha}}{ }_{\hat{\beta} \hat{\gamma}}=0gα^β^=ηαβ,Γα^β^γ^=0 ) momentarily moving with the fluid ( u α ^ = δ α 0 u α ^ = δ α 0 u^( hat(alpha))=delta^(alpha)_(0)u^{\hat{\alpha}}=\delta^{\alpha}{ }_{0}uα^=δα0 ), σ α ^ β ^ σ α ^ β ^ sigma_( hat(alpha) hat(beta))\sigma_{\hat{\alpha} \hat{\beta}}σα^β^ and ω α ˙ β ^ ω α ˙ β ^ omega_(alpha^(˙) hat(beta))\omega_{\dot{\alpha} \hat{\beta}}ωα˙β^ reduce to the classical (nonrelativistic) shear and rotation of the fluid. [See, e.g., $ 82.4 $ 82.4 $82.4\$ 82.4$82.4 and 2.5 of Ellis (1971) for both classical and relativistic descriptions of shear and rotation.]

Exercise 22.7. HYDRODYNAMICS WITH VISCOSITY AND HEAT FLOW.*

(a) In § 15 § 15 §15\S 15§15 of Landau and Lifshitz (1959), one finds an analysis of viscous stresses for a classical (nonrelativistic) fluid. By carrying that analysis over directly to the local Lorentz rest frame of a relativistic fluid, and by then generalizing to frame-independent language, show that the contribution of viscosity to the stress-energy tensor is
(22.16a) T ( visc ) = 2 η σ ζ θ P (22.16a) T ( visc ) = 2 η σ ζ θ P {:(22.16a)T^((visc))=-2eta sigma-zeta theta P:}\begin{equation*} \boldsymbol{T}^{(\mathrm{visc})}=-2 \eta \boldsymbol{\sigma}-\zeta \theta \boldsymbol{P} \tag{22.16a} \end{equation*}(22.16a)T(visc)=2ησζθP
where η 0 η 0 eta >= 0\eta \geq 0η0 is the "coefficient of dynamic viscosity"; ζ 0 ζ 0 zeta >= 0\zeta \geq 0ζ0 is the "coefficient of bulk viscosity"; and σ , θ , P σ , θ , P sigma,theta,P\sigma, \theta, \boldsymbol{P}σ,θ,P are the shear, expansion, and projection tensor of the fluid.
(b) An idealized description of heat flow in a fluid introduces the heat-flux 4-vector q q q\boldsymbol{q}q with components in the local rest-frame of the fluid,
(22.16b) q 0 ^ = 0 , q j ^ = ( energy per unit time crossing unit surface perpendicular to e 3 ) (22.16b) q 0 ^ = 0 , q j ^ = (  energy per unit time crossing unit   surface perpendicular to  e 3 ) {:(22.16b)q^( hat(0))=0","quadq^( hat(j))=((" energy per unit time crossing unit ")/(" surface perpendicular to "e_(3))):}\begin{equation*} q^{\hat{0}}=0, \quad q^{\hat{j}}=\binom{\text { energy per unit time crossing unit }}{\text { surface perpendicular to } \boldsymbol{e}_{3}} \tag{22.16b} \end{equation*}(22.16b)q0^=0,qj^=( energy per unit time crossing unit  surface perpendicular to e3)
By generalizing from the fluid rest frame to frame-independent language, show that the contribution of heat flux to the stress-energy tensor is
(22.16c) T (heat) = u q + q u (22.16c) T (heat)  = u q + q u {:(22.16c)T^((heat) )=u ox q+q ox u:}\begin{equation*} \boldsymbol{T}^{\text {(heat) }}=\boldsymbol{u} \otimes \boldsymbol{q}+\boldsymbol{q} \otimes \boldsymbol{u} \tag{22.16c} \end{equation*}(22.16c)T(heat) =uq+qu
Thereby conclude that, in this idealized picture, the stress-energy tensor for a viscous fluid with heat conduction is
(22.16d) T α β = ρ u α u β + ( p ζ θ ) P α β 2 η σ α β + q α u β + u α q β (22.16d) T α β = ρ u α u β + ( p ζ θ ) P α β 2 η σ α β + q α u β + u α q β {:(22.16d)T^(alpha beta)=rhou^(alpha)u^(beta)+(p-zeta theta)P^(alpha beta)-2etasigma^(alpha beta)+q^(alpha)u^(beta)+u^(alpha)q^(beta):}\begin{equation*} T^{\alpha \beta}=\rho u^{\alpha} u^{\beta}+(p-\zeta \theta) P^{\alpha \beta}-2 \eta \sigma^{\alpha \beta}+q^{\alpha} u^{\beta}+u^{\alpha} q^{\beta} \tag{22.16d} \end{equation*}(22.16d)Tαβ=ρuαuβ+(pζθ)Pαβ2ησαβ+qαuβ+uαqβ
(c) Define the entropy 4 -vector s s s\boldsymbol{s}s by
(22.16e) s n s u + q / T (22.16e) s n s u + q / T {:(22.16e)s-=nsu+q//T:}\begin{equation*} \boldsymbol{s} \equiv n s \boldsymbol{u}+\boldsymbol{q} / T \tag{22.16e} \end{equation*}(22.16e)snsu+q/T
By calculations in the local rest-frame of the fluid, show that
s = ( rate of increase of entropy in a unit volume ) ( rate at which heat and fluid carry entropy into a unit volume ) (22.16f) = ( rate at which entropy is being generated in a unit volume ) s = (  rate of increase of entropy   in a unit volume  ) (  rate at which heat and fluid   carry entropy into a unit volume  ) (22.16f) = (  rate at which entropy is being   generated in a unit volume  ) {:[grad*s=((" rate of increase of entropy ")/(" in a unit volume "))-((" rate at which heat and fluid ")/(" carry entropy into a unit volume "))],[(22.16f)=((" rate at which entropy is being ")/(" generated in a unit volume "))]:}\begin{align*} \boldsymbol{\nabla} \cdot \boldsymbol{s} & =\binom{\text { rate of increase of entropy }}{\text { in a unit volume }}-\binom{\text { rate at which heat and fluid }}{\text { carry entropy into a unit volume }} \\ & =\binom{\text { rate at which entropy is being }}{\text { generated in a unit volume }} \tag{22.16f} \end{align*}s=( rate of increase of entropy  in a unit volume )( rate at which heat and fluid  carry entropy into a unit volume )(22.16f)=( rate at which entropy is being  generated in a unit volume )
Thereby arrive at the following form of the second law of thermodynamics:
(22.16~g) s 0 (22.16~g) s 0 {:(22.16~g)grad*s >= 0:}\begin{equation*} \boldsymbol{\nabla} \cdot s \geq 0 \tag{22.16~g} \end{equation*}(22.16~g)s0
(d) Calculate the law of local energy conservation, u T = 0 u T = 0 u*grad*T=0\boldsymbol{u} \cdot \boldsymbol{\nabla} \cdot \boldsymbol{T}=0uT=0, for a viscous fluid with heat flow. Combine with the first law of thermodynamics and with the law of baryon conservation to obtain
(22.16h) T s = ζ θ 2 + 2 η σ α β σ α β q α ( T , α / T + a α ) (22.16h) T s = ζ θ 2 + 2 η σ α β σ α β q α T , α / T + a α {:(22.16h)T grad*s=zetatheta^(2)+2etasigma_(alpha beta)sigma^(alpha beta)-q^(alpha)(T_(,alpha)//T+a_(alpha)):}\begin{equation*} T \boldsymbol{\nabla} \cdot \boldsymbol{s}=\zeta \theta^{2}+2 \eta \sigma_{\alpha \beta} \sigma^{\alpha \beta}-q^{\alpha}\left(T_{, \alpha} / T+a_{\alpha}\right) \tag{22.16h} \end{equation*}(22.16h)Ts=ζθ2+2ησαβσαβqα(T,α/T+aα)
Interpret each term of this equation as a contribution to entropy generation (example: 2 η σ α β σ α β 2 η σ α β σ α β 2etasigma_(alpha beta)sigma^(alpha beta)2 \eta \sigma_{\alpha \beta} \sigma^{\alpha \beta}2ησαβσαβ describes entropy generation by viscous heating). [Note: The term q α a α q α a α q^(alpha)a_(alpha)q^{\alpha} a_{\alpha}qαaα is relativistic in origin. It is associated with the inertia of the flowing heat.]
(e) When one takes account of the inertia of the flowing heat, one obtains the following generalization of the classical law of heat conduction:
(22.16i) q α = κ P α β ( T , β + T a β ) (22.16i) q α = κ P α β T , β + T a β {:(22.16i)q^(alpha)=-kappaP^(alpha beta)(T_(,beta)+Ta_(beta)):}\begin{equation*} q^{\alpha}=-\kappa P^{\alpha \beta}\left(T_{, \beta}+T a_{\beta}\right) \tag{22.16i} \end{equation*}(22.16i)qα=κPαβ(T,β+Taβ)
(Eckart 1940). Here κ κ kappa\kappaκ is the coefficient of thermal conductivity. Use this equation to show that, for a fluid at rest in a stationary gravitational field (Exercise 22.5),
$$
(22.16j) q 0 = 0 , q j = κ g 00 ( T g 00 ) , j (22.16j) q 0 = 0 , q j = κ g 00 T g 00 , j {:(22.16j)q_(0)=0","quadq_(j)=-(kappa)/(sqrt(-g_(00)))(Tsqrt(-g_(00)))_(,j):}\begin{equation*} q_{0}=0, \quad q_{j}=-\frac{\kappa}{\sqrt{-g_{00}}}\left(T \sqrt{-g_{00}}\right)_{, j} \tag{22.16j} \end{equation*}(22.16j)q0=0,qj=κg00(Tg00),j
[ T h u s , t h e r m a l e q u i l i b r i u m c o r r e s p o n d s n o t t o c o n s t a n t t e m p e r a t u r e , b u t t o t h e r e d s h i f t e d t e m p e r a t u r e d i s t r i b u t i o n $ T g 00 = $ c o n s t a n t ; T o l m a n ( 1934 a ) , p . 313. ] A l s o , u s e t h e i d e a l i z e d l a w o f h e a t c o n d u c t i o n ( 22.16 i ) t o r e e x p r e s s t h e r a t e o f e n t r o p y g e n e r a t i o n a s [ T h u s , t h e r m a l e q u i l i b r i u m c o r r e s p o n d s n o t t o c o n s t a n t t e m p e r a t u r e , b u t t o t h e r e d s h i f t e d t e m p e r a t u r e d i s t r i b u t i o n $ T g 00 = $ c o n s t a n t ; T o l m a n ( 1934 a ) , p . 313. ] A l s o , u s e t h e i d e a l i z e d l a w o f h e a t c o n d u c t i o n ( 22.16 i ) t o r e e x p r e s s t h e r a t e o f e n t r o p y g e n e r a t i o n a s [Thus,thermalequilibriumcorrespondsnottoconstanttemperature,buttotheredshiftedtemperaturedistribution$Tsqrt(-g_(00))=$constant;Tolman(1934 a),p.313.]Also,usetheidealizedlawofheatconduction(22.16 i)toreexpresstherateofentropygenerationas[Thus, thermal equilibrium corresponds not to constant temperature, but to the redshifted temperature distribution $T \sqrt{-g_{00}}=$ constant; Tolman (1934a), p. 313.] Also, use the idealized law of heat conduction (22.16i) to reexpress the rate of entropy generation as[Thus,thermalequilibriumcorrespondsnottoconstanttemperature,buttotheredshiftedtemperaturedistribution$Tg00=$constant;Tolman(1934a),p.313.]Also,usetheidealizedlawofheatconduction(22.16i)toreexpresstherateofentropygenerationas
(22.16k) T s = ζ θ 2 + 2 η σ α β σ α β + ( κ / T ) P α β ( T , α + T a α ) ( T , β + T a β ) 0 (22.16k) T s = ζ θ 2 + 2 η σ α β σ α β + ( κ / T ) P α β T , α + T a α T , β + T a β 0 {:[(22.16k)T grad*s=zetatheta^(2)+2etasigma_(alpha beta)sigma^(alpha beta)+(kappa//T)P^(alpha beta)(T_(,alpha)+Ta_(alpha))(T_(,beta)+Ta_(beta))],[ >= 0]:}\begin{align*} T \nabla \cdot \boldsymbol{s} & =\zeta \theta^{2}+2 \eta \sigma_{\alpha \beta} \sigma^{\alpha \beta}+(\kappa / T) P^{\alpha \beta}\left(T_{, \alpha}+T a_{\alpha}\right)\left(T_{, \beta}+T a_{\beta}\right) \tag{22.16k}\\ & \geq 0 \end{align*}(22.16k)Ts=ζθ2+2ησαβσαβ+(κ/T)Pαβ(T,α+Taα)(T,β+Taβ)0
$$
[For further details about heat flow and for discussions of the limitations of the above idealized description, see e.g., § 4.18 § 4.18 §4.18\S 4.18§4.18 of Ehlers (1971); also Marle (1969), Anderson (1970), Stewart (1971), and papers cited therein.]
Electric and magnetic fields
Maxwell equations and Lorentz force law

§22.4. ELECTRODYNAMICS IN CURVED SPACETIME

In a local Lorentz frame in the presence of gravity, an observer can measure the electric and magnetic fields E E E\boldsymbol{E}E and B B B\boldsymbol{B}B using the usual Lorentz force law for charged particles. As in special relativity, he can regard E E E\boldsymbol{E}E and B B B\boldsymbol{B}B as components of an electromagnetic field tensor,
F 0 ^ j ^ = F j 0 ^ ^ = E j ^ , F j k ^ ^ = ϵ j ^ ^ B ı ^ ; , . F 0 ^ j ^ = F j 0 ^ ^ = E j ^ , F j k ^ ^ = ϵ j ^ ^ B ı ^ ; , . F^( hat(0) hat(j))=-F^( hat(j( hat(0)))=E^( hat(j)),quadF^( hat(j( hat(k))))=epsilon^( hat(j) hat(ℓ))B^( hat(ı));,.)F^{\hat{0} \hat{j}}=-F^{\hat{j \hat{0}}=E^{\hat{j}}, \quad F^{\hat{j \hat{k}}}=\epsilon^{\hat{j} \hat{\ell}} B^{\hat{\imath}} ;, ~ . ~}F0^j^=Fj0^^=Ej^,Fjk^^=ϵj^^Bı^;, . 
he can regard the charge and current densities as components of a 4-vector J α ^ J α ^ J^( hat(alpha))J^{\hat{\alpha}}Jα^, and he can write Maxwell's equations and the Lorentz force equation in the special relativistic form,
F α ^ β ^ , β ^ = 4 π J α ^ , F α ^ β ^ , γ ^ + F β ^ γ ^ , α ^ + F γ ^ α ^ , β ^ = 0 , m a α ^ = F α ^ β ^ q u β ^ ( m = mass of particle, q = charge u α ^ = 4 -velocity, a α ^ = 4 -acceleration ) . F α ^ β ^ , β ^ = 4 π J α ^ , F α ^ β ^ , γ ^ + F β ^ γ ^ , α ^ + F γ ^ α ^ , β ^ = 0 , m a α ^ = F α ^ β ^ q u β ^ ( m =  mass of particle,  q =  charge  u α ^ = 4 -velocity,  a α ^ = 4 -acceleration  ) . {:[F^( hat(alpha) hat(beta))_(, hat(beta))=4piJ^( hat(alpha))","quadF_( hat(alpha) hat(beta), hat(gamma))+F_( hat(beta) hat(gamma), hat(alpha))+F_( hat(gamma) hat(alpha), hat(beta))=0","],[ma^( hat(alpha))=F^( hat(alpha) hat(beta))qu_( hat(beta))quad((m=" mass of particle, "q=" charge ")/(u^( hat(alpha))=4"-velocity, "a^( hat(alpha))=4"-acceleration ")).]:}\begin{gathered} F^{\hat{\alpha} \hat{\beta}}{ }_{, \hat{\beta}}=4 \pi J^{\hat{\alpha}}, \quad F_{\hat{\alpha} \hat{\beta}, \hat{\gamma}}+F_{\hat{\beta} \hat{\gamma}, \hat{\alpha}}+F_{\hat{\gamma} \hat{\alpha}, \hat{\beta}}=0, \\ m a^{\hat{\alpha}}=F^{\hat{\alpha} \hat{\beta}} q u_{\hat{\beta}} \quad\binom{m=\text { mass of particle, } q=\text { charge }}{u^{\hat{\alpha}}=4 \text {-velocity, } a^{\hat{\alpha}}=4 \text {-acceleration }} . \end{gathered}Fα^β^,β^=4πJα^,Fα^β^,γ^+Fβ^γ^,α^+Fγ^α^,β^=0,maα^=Fα^β^quβ^(m= mass of particle, q= charge uα^=4-velocity, aα^=4-acceleration ).
In any other frame these equations will have the same form, but with commas replaced by semicolons
(22.17a) F ; β α β = 4 π J α (22.17b) F α β ; γ + F β γ ; α + F γ α ; β = 0 , (22.17c) m a α = F α β q u β . (22.17a) F ; β α β = 4 π J α (22.17b) F α β ; γ + F β γ ; α + F γ α ; β = 0 , (22.17c) m a α = F α β q u β . {:[(22.17a)F_(;beta)^(alpha beta)=4piJ^(alpha)],[(22.17b)F_(alpha beta;gamma)+F_(beta gamma;alpha)+F_(gamma alpha;beta)=0","],[(22.17c)ma^(alpha)=F^(alpha beta)qu_(beta).]:}\begin{gather*} F_{; \beta}^{\alpha \beta}=4 \pi J^{\alpha} \tag{22.17a}\\ F_{\alpha \beta ; \gamma}+F_{\beta \gamma ; \alpha}+F_{\gamma \alpha ; \beta}=0, \tag{22.17b}\\ m a^{\alpha}=F^{\alpha \beta} q u_{\beta} . \tag{22.17c} \end{gather*}(22.17a)F;βαβ=4πJα(22.17b)Fαβ;γ+Fβγ;α+Fγα;β=0,(22.17c)maα=Fαβquβ.
These are the basic equations of electrodynamics in the presence of gravity. From them follows everything else. For example, as in special relativity, so also here (exercise 22.9), they imply the equation of charge conservation
(22.18a) J ; α α = 0 (22.18a) J ; α α = 0 {:(22.18a)J_(;alpha)^(alpha)=0:}\begin{equation*} J_{; \alpha}^{\alpha}=0 \tag{22.18a} \end{equation*}(22.18a)J;αα=0
and for an electromagnetic field interacting with charged matter (exercise 22.10) they imply vanishing divergence for the sum of the stress-energy tensors
(22.18b) ( T ( EM ) α β + T ( MATTER ) α β ) ; β = 0 (22.18b) T ( EM ) α β + T ( MATTER ) α β ; β = 0 {:(22.18b)(T^((EM)alpha beta)+T^((MATTER)alpha beta))_(;beta)=0:}\begin{equation*} \left(T^{(\mathrm{EM}) \alpha \beta}+T^{(\mathrm{MATTER}) \alpha \beta}\right)_{; \beta}=0 \tag{22.18b} \end{equation*}(22.18b)(T(EM)αβ+T(MATTER)αβ);β=0
As in special relativity, so also here, one can introduce a vector potential A μ A μ A^(mu)A^{\mu}Aμ. Replacing commas by semicolons in the usual special-relativistic expression for F μ ν F μ ν F^(mu nu)F^{\mu \nu}Fμν in terms of A μ A μ A^(mu)A^{\mu}Aμ, one obtains
(22.19a) F μ ν = A ν ; μ A μ ; ν (22.19a) F μ ν = A ν ; μ A μ ; ν {:(22.19a)F_(mu nu)=A_(nu;mu)-A_(mu;nu):}\begin{equation*} F_{\mu \nu}=A_{\nu ; \mu}-A_{\mu ; \nu} \tag{22.19a} \end{equation*}(22.19a)Fμν=Aν;μAμ;ν
If all is well, this equation should guarantee (as in special relativity) that the Maxwell equations (22.17b) are satisfied. Indeed, it does, as one sees in exercise 22.8. To derive the wave equation that governs the vector potential, insert expression (22.19a) into the remaining Maxwell equations (22.17a), obtaining
(22.19b) A α ; β β + A β ; α β = 4 π J α (22.19b) A α ; β β + A β ; α β = 4 π J α {:(22.19b)-A^(alpha;beta)_(beta)+A^(beta;alpha)_(beta)=4piJ^(alpha):}\begin{equation*} -A^{\alpha ; \beta}{ }_{\beta}+A^{\beta ; \alpha}{ }_{\beta}=4 \pi J^{\alpha} \tag{22.19b} \end{equation*}(22.19b)Aα;ββ+Aβ;αβ=4πJα
then commute covariant derivatives in the first term using the identity ( 16.6 c ), to obtain
( ) A α ; μ μ + A μ ; μ ; α + R α μ A μ = 4 π J α . ( ) A α ; μ μ + A μ ; μ ; α + R α μ A μ = 4 π J α . {:('")"-A^(alpha;mu)_(mu)+A^(mu)_(;mu)^(;alpha)+R^(alpha)_(mu)A^(mu)=4piJ^(alpha).:}\begin{equation*} -A^{\alpha ; \mu}{ }_{\mu}+A^{\mu}{ }_{; \mu}^{; \alpha}+R^{\alpha}{ }_{\mu} A^{\mu}=4 \pi J^{\alpha} . \tag{$\prime$} \end{equation*}()Aα;μμ+Aμ;μ;α+RαμAμ=4πJα.
Finally, adopting the standard approach of special relativity, impose the Lorentz gauge condition
(22.19c) A μ ; μ = 0 , (22.19c) A μ ; μ = 0 , {:(22.19c)A^(mu)_(;mu)=0",":}\begin{equation*} A^{\mu}{ }_{; \mu}=0, \tag{22.19c} \end{equation*}(22.19c)Aμ;μ=0,
thereby bringing the wave equation ( 22.19 b 22.19 b 22.19b^(')22.19 b^{\prime}22.19b ) into the form
(22.19d) ( Δ d R A ) α A α ; β β + R α β A β = 4 π J α . (22.19d) Δ d R A α A α ; β β + R α β A β = 4 π J α . {:(22.19d)(Delta_(dR)A)^(alpha)-=-A^(alpha;beta)_(beta)+R^(alpha)_(beta)A^(beta)=4piJ^(alpha).:}\begin{equation*} \left(\Delta_{d R} A\right)^{\alpha} \equiv-A^{\alpha ; \beta}{ }_{\beta}+R^{\alpha}{ }_{\beta} A^{\beta}=4 \pi J^{\alpha} . \tag{22.19d} \end{equation*}(22.19d)(ΔdRA)αAα;ββ+RαβAβ=4πJα.
The "de Rham vector wave operator" Δ Δ Delta\DeltaΔ which appears here is, apart from sign, a generalized d'Alambertian for vectors in curved spacetime. Mathematically it is more powerful than A α ; β ; β A α ; β ; β -A^(alpha;beta)_(;beta)-A^{\alpha ; \beta}{ }_{; \beta}Aα;β;β, and than any other operator that reduces to (minus) the d'Alambertian in special relativity. [For a discussion, see de Rham (1955).]
Although the electrodynamic equations (22.17a)-(22.19b) are all obtained from special relativity by the comma-goes-to-semicolon rule, the wave equation ( 22.19 d ) for the vector potential is not ("curvature coupling"; see Box 16.1). Nevertheless, when spacetime is flat (so R α β = 0 R α β = 0 R^(alpha)_(beta)=0R^{\alpha}{ }_{\beta}=0Rαβ=0 ), ( 22.19 d ) does reduce to the usual wave equation of special relativity.

Exercise 22.8. THE VECTOR POTENTIAL FOR ELECTRODYNAMICS

Show that in any coordinate frame the connection coefficients cancel out of both equations (22.19a) and (22.17b), so they can be written
(22.20a) F μ ν = A ν , μ A μ , v , (22.20b) F α β , γ + F β γ , α + F γ α , β = 0 . (22.20a) F μ ν = A ν , μ A μ , v , (22.20b) F α β , γ + F β γ , α + F γ α , β = 0 . {:[(22.20a)F_(mu nu)=A_(nu,mu)-A_(mu,v)","],[(22.20b)F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0.]:}\begin{gather*} F_{\mu \nu}=A_{\nu, \mu}-A_{\mu, v}, \tag{22.20a}\\ F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0 . \tag{22.20b} \end{gather*}(22.20a)Fμν=Aν,μAμ,v,(22.20b)Fαβ,γ+Fβγ,α+Fγα,β=0.
(In the language of differential forms these equations are F = d A , d F = 0 F = d A , d F = 0 F=dA,dF=0\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}, \boldsymbol{d} \boldsymbol{F}=0F=dA,dF=0.) Then use this form of the equations to show that equation (22.19a) implies equation (22.17b), as asserted in the text.

Exercise 22.9. CHARGE CONSERVATION IN THE PRESENCE OF GRAVITY

Show that Maxwell's equations (22.17a,b) imply the equation of charge conservation (22.18a) when gravity is present, just as they do in special relativity theory. [Hints: Use the antisymmetry of F α β F α β F^(alpha beta)F^{\alpha \beta}Fαβ; and beware of the noncommutation of the covariant derivatives, which must be handled using equations (16.6). Alternatively, show that in coordinate frames, equation (22.17a) can be written as
( ) 1 | g | x β ( | g | F α β ) = 4 π J α ( ) 1 | g | x β | g | F α β = 4 π J α {:('")"(1)/(sqrt(|g|))(del)/(delx^(beta))(sqrt(|g|)F^(alpha beta))=4piJ^(alpha):}\begin{equation*} \frac{1}{\sqrt{|g|}} \frac{\partial}{\partial x^{\beta}}\left(\sqrt{|g|} F^{\alpha \beta}\right)=4 \pi J^{\alpha} \tag{$\prime$} \end{equation*}()1|g|xβ(|g|Fαβ)=4πJα
and (22.18a) as
( ) J ; α α 1 | g | x α ( | g | J α ) = 0 ( ) J ; α α 1 | g | x α | g | J α = 0 {:('")"J_(;alpha)^(alpha)-=(1)/(sqrt(|g|))(del)/(delx^(alpha))(sqrt(|g|)J^(alpha))=0:}\begin{equation*} J_{; \alpha}^{\alpha} \equiv \frac{1}{\sqrt{|g|}} \frac{\partial}{\partial x^{\alpha}}\left(\sqrt{|g|} J^{\alpha}\right)=0 \tag{$\prime$} \end{equation*}()J;αα1|g|xα(|g|Jα)=0
and carry out the demonstration in a coordinate frame.]

Exercise 22.10. INTERACTING ELECTROMAGNETIC FIELD AND CHARGED MATTER

As in special relativity, so also in the presence of gravity ("equivalence principle"), the stress-energy tensor for an electromagnetic field is
(22.21) T ( EM ) α β = 1 4 π ( F α μ F β μ 1 4 F μ ν F μ ν g α β ) . (22.21) T ( EM ) α β = 1 4 π F α μ F β μ 1 4 F μ ν F μ ν g α β . {:(22.21)T^((EM))_(alpha beta)=(1)/(4pi)(F_(alpha mu)F_(beta)^(mu)-(1)/(4)F_(mu nu)F^(mu nu)g_(alpha beta)).:}\begin{equation*} T^{(\mathrm{EM})}{ }_{\alpha \beta}=\frac{1}{4 \pi}\left(F_{\alpha \mu} F_{\beta}^{\mu}-\frac{1}{4} F_{\mu \nu} F^{\mu \nu} g_{\alpha \beta}\right) . \tag{22.21} \end{equation*}(22.21)T(EM)αβ=14π(FαμFβμ14FμνFμνgαβ).
Use Maxwell's equations ( 22.17 a , b ) ( 22.17 a , b ) (22.17a,b)(22.17 \mathrm{a}, \mathrm{b})(22.17a,b) in the presence of gravity to show that
(22.22) T ( EM ) α β : β = F α β J β . (22.22) T ( EM ) α β : β = F α β J β . {:(22.22)T^((EM)alpha beta)_(:beta)=-F^(alpha beta)J_(beta).:}\begin{equation*} T^{(\mathrm{EM}) \alpha \beta}{ }_{: \beta}=-F^{\alpha \beta} J_{\beta} . \tag{22.22} \end{equation*}(22.22)T(EM)αβ:β=FαβJβ.
But F α β J β F α β J β F^(alpha beta)J_(beta)F^{\alpha \beta} J_{\beta}FαβJβ is just the Lorentz 4-force per unit volume with which the electromagnetic field acts on the charged matter [see the Lorentz force equation (22.17c); also equation (5.43)]; i.e., it is T ( MATTER ) α β ; β T ( MATTER  ) α β ; β T^(("MATTER ")alpha beta)_(;beta)T^{(\text {MATTER }) \alpha \beta}{ }_{; \beta}T(MATTER )αβ;β. Consequently, the above equation can be rewritten in the form (22.18b) cited in the text.

§22.5. GEOMETRIC OPTICS IN CURVED SPACETIME*

Radio waves from the quasar 3 C 279 pass near the sun and get deflected by its gravitational field. Light rays emitted by newborn galaxies long ago and far away propagate through the cosmologically curved spacetime of the universe, and get focused (and redshifted) producing curvature-enlarged (but dim) images of the galaxies on the Earth's sky.
These and most other instances of the propagation of light and radio waves are subject to the laws of geometric optics. This section derives those laws, in curved spacetime, from Maxwell's equations.
The fundamental laws of geometric optics are: (1) light rays are null geodesics; (2) the polarization vector is perpendicular to the rays and is parallel-propagated along the rays; and (3) the amplitude is governed by an adiabatic invariant which, in quantum language, states that the number of photons is conserved.
The conditions under which these laws hold are defined by conditions on three lengths: (1) the typical reduced wavelength of the waves,
(22.23a) λ λ 2 π = ( "classical distance of closest approach for a photon with one unit of angular momentum"" ) , (22.23a) λ λ 2 π = (  "classical distance of closest approach for   a photon with one unit of angular momentum""  ) , {:(22.23a)lambda-=(lambda)/(2pi)=((" "classical distance of closest approach for ")/(" a photon with one unit of angular momentum"" "))",":}\begin{equation*} \lambda \equiv \frac{\lambda}{2 \pi}=\binom{\text { "classical distance of closest approach for }}{\text { a photon with one unit of angular momentum"" }}, \tag{22.23a} \end{equation*}(22.23a)λλ2π=( "classical distance of closest approach for  a photon with one unit of angular momentum"" ),
as measured in a typical local Lorentz frame (e.g., a frame at rest relative to nearby galaxies); (2) the typical length E E E\mathcal{E}E over which the amplitude, polarization, and wavelength of the waves vary, e.g., the radius of curvature of a wave front, or the length of a wave packet produced by a sudden outburst in a quasar; (3) the typical radius of curvature R R R\mathscr{R}R of the spacetime through which the waves propagate,
(22.23b) R | typical component of Riemann as measured in typical local Lorentz frame | 1 / 2 (22.23b) R  typical component of Riemann as measured   in typical local Lorentz frame  1 / 2 {:(22.23b)R-=|[" typical component of Riemann as measured "],[" in typical local Lorentz frame "]|^(-1//2):}\mathscr{R} \equiv\left|\begin{array}{l} \text { typical component of Riemann as measured } \tag{22.23b}\\ \text { in typical local Lorentz frame } \end{array}\right|^{-1 / 2}(22.23b)R| typical component of Riemann as measured  in typical local Lorentz frame |1/2
Geometric optics is valid whenever the reduced wavelength is very short compared to each of the other scales present,
(22.23c) λ E and λ Ω , (22.23c) λ E  and  λ Ω , {:(22.23c)lambda≪Equad" and "quad lambda≪Omega",":}\begin{equation*} \lambda \ll \mathcal{E} \quad \text { and } \quad \lambda \ll \Omega, \tag{22.23c} \end{equation*}(22.23c)λE and λΩ,
so that the waves can be regarded locally as plane waves propagating through spacetime of negligible curvature.
Mathematically one exploits the geometric-optics assumption, λ E λ E lambda≪E\lambda \ll \mathcal{E}λE and λ R λ R lambda≪R\lambda \ll \mathscr{R}λR, as follows. Focus attention on waves that are highly monochromatic over regions E E <= E\leq \mathcal{E}E. (More complex spectra can be analyzed by superposition, i.e., by Fourier analysis.) Split the vector potential of electromagnetic theory into a rapidly changing, real phase,
θ ( distance propagated ) / π θ (  distance propagated  ) / π theta∼(" distance propagated ")//pi\theta \sim(\text { distance propagated }) / \piθ( distance propagated )/π
and a slowly changing, complex amplitude (i.e. one with real and imaginary parts),
A = Real part of { amplitude × e i θ } X { amplitude × e i θ } . A =  Real part of   amplitude  × e i θ X  amplitude  × e i θ . A=" Real part of "{" amplitude "xxe^(i theta)}-=X{" amplitude "xxe^(i theta)}.\boldsymbol{A}=\text { Real part of }\left\{\text { amplitude } \times e^{i \theta}\right\} \equiv \mathbf{X}\left\{\text { amplitude } \times e^{i \theta}\right\} .A= Real part of { amplitude ×eiθ}X{ amplitude ×eiθ}.
Imagine holding fixed the scale of the amplitude variation, L L L\mathcal{L}L, and the scale of the spacetime curvature, R R R\mathscr{R}R, while making the reduced wavelength, λ λ lambda\lambdaλ, shorter and shorter. The phase will get larger and larger ( θ 1 / λ ) ( θ 1 / λ ) (theta prop1//lambda)(\theta \propto 1 / \lambda)(θ1/λ) at any fixed event in spacetime, but the amplitude as a function of location in spacetime can remain virtually unchanged,
Amplitude = [ dominant part, independent of λ ] + [ small corrections (deviations from geometric optics) due to finite wavelength ] =  dominant part,   independent of  λ +  small corrections (deviations from   geometric optics) due to finite wavelength  =[[" dominant part, "],[" independent of "lambda]]+[[" small corrections (deviations from "],[" geometric optics) due to finite wavelength "]]=\left[\begin{array}{l}\text { dominant part, } \\ \text { independent of } \lambda\end{array}\right]+\left[\begin{array}{l}\text { small corrections (deviations from } \\ \text { geometric optics) due to finite wavelength }\end{array}\right]=[ dominant part,  independent of λ]+[ small corrections (deviations from  geometric optics) due to finite wavelength ].
Overview of geometric optics
Conditions for validity of geometric optics
The "two-length-scale" expansion underlying geometric optics
This circumstance allows one to expand the amplitude in powers of λ λ lambda\lambdaλ :*
Amplitude = a + b c + c + . [ independent of t ]  Amplitude  = a + b c + c + .  independent   of  t {:[" Amplitude "=a uarr+b_(c)+c uarr+dots.],[[[" independent "],[" of "t]]]:}\begin{gathered} \text { Amplitude }=\underset{\uparrow}{\boldsymbol{a}}+\underset{\boldsymbol{c}}{\boldsymbol{b}}+\underset{\uparrow}{\boldsymbol{c}}+\ldots . \\ {\left[\begin{array}{c} \text { independent } \\ \text { of } t \end{array}\right]} \end{gathered} Amplitude =a+bc+c+.[ independent  of t]
[Actually, the expansion proceeds in powers of the dimensionless number
(22.24) λ / ( minimum of E and R ) λ / L (22.24) λ / (  minimum of  E  and  R ) λ / L {:(22.24)lambda//(" minimum of "E" and "R)-=lambda//L:}\begin{equation*} \lambda /(\text { minimum of } \mathcal{E} \text { and } \mathscr{R}) \equiv \lambda / L \tag{22.24} \end{equation*}(22.24)λ/( minimum of E and R)λ/L
Applied mathematicians call this a "two-length-scale expansion"; see, e.g., Cole (1968). The basic short-wavelength approximation here has a long history; see, e.g., Liouville (1837), Rayleigh (1912). Following a suggestion of Debye, it was applied to Maxwell's equations by Sommerfeld and Runge (1911). It is familiar as the WKB approximation in quantum mechanics, and has many other applications as indicated by the bibliography in Keller, Lewis, and Seckler (1956). The contribution of higher order terms is considered by Kline (1954) and Lewis (1958). See especially the book of Fröman and Fröman (1965).]
It is useful to introduce a parameter ε ε epsi\varepsilonε that keeps track of how rapidly various terms approach zero (or infinity) as λ / L λ / L lambda//L\lambda / Lλ/L approaches zero:
(22.25) A μ = x { ( a μ + ε b μ + ε 2 c μ + ) e i θ / ε } . (22.25) A μ = x a μ + ε b μ + ε 2 c μ + e i θ / ε . {:(22.25)A_(mu)=x{(a_(mu)+epsib_(mu)+epsi^(2)c_(mu)+cdots)e^(i theta//epsi)}.:}\begin{equation*} A_{\mu}=\boldsymbol{x}\left\{\left(a_{\mu}+\varepsilon b_{\mu}+\varepsilon^{2} c_{\mu}+\cdots\right) e^{i \theta / \varepsilon}\right\} . \tag{22.25} \end{equation*}(22.25)Aμ=x{(aμ+εbμ+ε2cμ+)eiθ/ε}.
Any term with a factor ε n ε n epsi^(n)\varepsilon^{n}εn in front of it varies as ( λ / L ) n ( λ / L ) n (lambda//L)^(n)(\lambda / L)^{n}(λ/L)n in the limit of very small wavelengths [ θ ( λ / L ) 1 ; c μ ( λ / L ) 2 θ ( λ / L ) 1 ; c μ ( λ / L ) 2 [theta prop(lambda//L)^(-1);c_(mu)prop(lambda//L)^(2):}\left[\theta \propto(\lambda / L)^{-1} ; c_{\mu} \propto(\lambda / L)^{2}\right.[θ(λ/L)1;cμ(λ/L)2; etc.]. By convention, ε ε epsi\varepsilonε is a dummy expansion parameter with eventual value unity; so it can be dropped from the calculations when it ceases to be useful. And by convention, all "post-geometric-optics corrections" are put into the amplitude terms b , c , b , c , b,c,dots\boldsymbol{b}, \boldsymbol{c}, \ldotsb,c,; none are put into θ θ theta\thetaθ.
Note that, while the phase θ θ theta\thetaθ is a real function of position in spacetime, the amplitude and hence the vectors a , b , c , a , b , c , a,b,c,dots\boldsymbol{a}, \boldsymbol{b}, \boldsymbol{c}, \ldotsa,b,c, are complex. For example, to describe monochromatic waves with righthand circular polarization, propagating in the z z zzz direction, one could set θ = ω ( z t ) θ = ω ( z t ) theta=omega(z-t)\theta=\omega(z-t)θ=ω(zt) and a = 1 / 2 a ( e x + i e y ) a = 1 / 2 a e x + i e y a=1//sqrt2a(e_(x)+ie_(y))\boldsymbol{a}=1 / \sqrt{2} a\left(\boldsymbol{e}_{x}+i \boldsymbol{e}_{y}\right)a=1/2a(ex+iey) with a a aaa real; so
A = { 1 2 a ( e x + i e y ) e i ω ( z t ) } = 1 2 a { cos [ ω ( z t ) ] e x sin [ ω ( z t ) ] e y } A = 1 2 a e x + i e y e i ω ( z t ) = 1 2 a cos [ ω ( z t ) ] e x sin [ ω ( z t ) ] e y A=aleph{(1)/(sqrt2)a(e_(x)+ie_(y))e^(i omega(z-t))}=(1)/(sqrt2)a{cos[omega(z-t)]e_(x)-sin[omega(z-t)]e_(y)}\boldsymbol{A}=\mathfrak{\aleph}\left\{\frac{1}{\sqrt{2}} a\left(\boldsymbol{e}_{x}+i \boldsymbol{e}_{y}\right) e^{i \omega(z-t)}\right\}=\frac{1}{\sqrt{2}} a\left\{\cos [\omega(z-t)] \boldsymbol{e}_{x}-\sin [\omega(z-t)] \boldsymbol{e}_{y}\right\}A={12a(ex+iey)eiω(zt)}=12a{cos[ω(zt)]exsin[ω(zt)]ey}
The assumed form (22.25) for the vector potential is the mathematical foundation of geometric optics. All the key equations of geometric optics result from inserting this vector potential into the source-free wave equation Δ A = 0 Δ A = 0 Delta A=0\boldsymbol{\Delta A}=0ΔA=0 [equation (22.19d)] and into the Lorentz gauge condition A = 0 A = 0 grad*A=0\boldsymbol{\nabla} \cdot \boldsymbol{A}=0A=0 [equation (22.19c)]. The resulting equations (derived below) take their simplest form only when expressed in terms of the following:
Basic concepts of geometric
optics:
The vector potential in geometric optics
(22.26a) "wave vector," k θ (22.26a)  "wave vector,"  k θ {:(22.26a)" "wave vector," "k-=grad theta:}\begin{equation*} \text { "wave vector," } \boldsymbol{k} \equiv \boldsymbol{\nabla} \theta \tag{22.26a} \end{equation*}(22.26a) "wave vector," kθ
(22.26b) "scalar amplitude," a ( a a ) 1 / 2 = ( a μ a ¯ μ ) 1 / 2 (22.26b)  "scalar amplitude,"  a ( a a ¯ ) 1 / 2 = a μ a ¯ μ 1 / 2 {:(22.26b)" "scalar amplitude," "a-=(a* bar(a))^(1//2)=(a^(mu) bar(a)_(mu))^(1//2):}\begin{equation*} \text { "scalar amplitude," } a \equiv(\boldsymbol{a} \cdot \overline{\boldsymbol{a}})^{1 / 2}=\left(a^{\mu} \bar{a}_{\mu}\right)^{1 / 2} \tag{22.26b} \end{equation*}(22.26b) "scalar amplitude," a(aa)1/2=(aμa¯μ)1/2
"polarization vector," f a / a = f a / a = f-=a//a=\boldsymbol{f} \equiv \boldsymbol{a} / a=fa/a= "unit complex vector along a a a\boldsymbol{a}a ". (22.26c)
(Here a a ¯ bar(a)\overline{\mathbf{a}}a is the complex conjugate of a a a\boldsymbol{a}a.) Light rays are defined to be the curves P ( λ ) P ( λ ) P(lambda)\mathscr{P}(\lambda)P(λ) normal to surfaces of constant phase θ θ theta\thetaθ. Since k θ k θ k-=grad theta\boldsymbol{k} \equiv \boldsymbol{\nabla} \thetakθ is the normal to these surfaces, the differential equation for a light ray is
(22.26~d) d x μ d λ = k μ ( x ) = g μ ν ( x ) θ , v ( x ) (22.26~d) d x μ d λ = k μ ( x ) = g μ ν ( x ) θ , v ( x ) {:(22.26~d)(dx^(mu))/(d lambda)=k^(mu)(x)=g^(mu nu)(x)theta_(,v)(x):}\begin{equation*} \frac{d x^{\mu}}{d \lambda}=k^{\mu}(x)=g^{\mu \nu}(x) \theta_{, v}(x) \tag{22.26~d} \end{equation*}(22.26~d)dxμdλ=kμ(x)=gμν(x)θ,v(x)
Box 22.3, appropriate for study at this point, shows the polarization vector, wave vector, surfaces of constant phase, and light rays for a propagating wave; the scalar amplitude, not shown there, merely tells the length of the vector amplitude a a a\boldsymbol{a}a. Insight into the complex polarization vector, if not familiar from electrodynamics, can be developed later in Exercise 22.12.
So much for the foundations. Now for the calculations. First insert the geometricoptics vector potential (22.25) into the Lorentz gauge condition:
(22.27) 0 = A ; μ μ = x { [ i ε k μ ( a μ + ε b μ + ) + ( a μ + ε b μ + ) ; μ ] e i θ / ε } . (22.27) 0 = A ; μ μ = x i ε k μ a μ + ε b μ + + a μ + ε b μ + ; μ e i θ / ε . {:(22.27)0=A_(;mu)^(mu)=x{[(i)/( epsi)k_(mu)(a^(mu)+epsib^(mu)+cdots)+(a^(mu)+epsib^(mu)+cdots)_(;mu)]e^(i theta//epsi)}.:}\begin{equation*} 0=A_{; \mu}^{\mu}=\boldsymbol{x}\left\{\left[\frac{i}{\varepsilon} k_{\mu}\left(a^{\mu}+\varepsilon b^{\mu}+\cdots\right)+\left(a^{\mu}+\varepsilon b^{\mu}+\cdots\right)_{; \mu}\right] e^{i \theta / \varepsilon}\right\} . \tag{22.27} \end{equation*}(22.27)0=A;μμ=x{[iεkμ(aμ+εbμ+)+(aμ+εbμ+);μ]eiθ/ε}.
The leading term (order 1 / ε 1 / ε 1//epsi1 / \varepsilon1/ε ) says
(22.28) k a = 0 ( amplitude is perpendicular to wave vector ) (22.28) k a = 0 (  amplitude is perpendicular to wave vector  ) {:(22.28)k*a=0(" amplitude is perpendicular to wave vector "):}\begin{equation*} \boldsymbol{k} \cdot \boldsymbol{a}=0(\text { amplitude is perpendicular to wave vector }) \tag{22.28} \end{equation*}(22.28)ka=0( amplitude is perpendicular to wave vector )
or, equivalently
( ) k f = 0 (polarization is perpendicular to wave vector). ( ) k f = 0  (polarization is perpendicular to wave vector).  {:('")"k*f=0" (polarization is perpendicular to wave vector). ":}\begin{equation*} \boldsymbol{k} \cdot \boldsymbol{f}=0 \text { (polarization is perpendicular to wave vector). } \tag{$\prime$} \end{equation*}()kf=0 (polarization is perpendicular to wave vector). 
The post-geometric-optics breakdown in this orthogonality condition is governed by the higher-order terms [ 0 ( 1 ) , 0 ( ε ) , 0 ( ε 2 ) , ] 0 ( 1 ) , 0 ( ε ) , 0 ε 2 , [0(1),0(epsi),0(epsi^(2)),dots]\left[0(1), 0(\varepsilon), 0\left(\varepsilon^{2}\right), \ldots\right][0(1),0(ε),0(ε2),] in the gauge condition (22.27); for example, the 0 ( 1 ) 0 ( 1 ) 0(1)0(1)0(1) terms say
k b = i a k b = i a k*b=i grad*a\boldsymbol{k} \cdot \boldsymbol{b}=i \boldsymbol{\nabla} \cdot \boldsymbol{a}kb=ia
Next insert the vector potential (22.25) into the source-free wave equation (22.19d):
0 = ( Δ d R A ) α = A α ; β β + R α β A β = { [ 1 ε 2 k β k β ( a α + ε b α + ε 2 c α + ) 2 i ε k β ( a α + ε b α + ) ; β (22.29) i ε k β ; β ( a α + ε b α + ) ( a α + ) ; β β + R α β ( a β + ) ] e i θ / ε } . 0 = Δ d R A α = A α ; β β + R α β A β = 1 ε 2 k β k β a α + ε b α + ε 2 c α + 2 i ε k β a α + ε b α + ; β (22.29) i ε k β ; β a α + ε b α + a α + ; β β + R α β a β + e i θ / ε . {:[0=(Delta_(dR)A)^(alpha)=-A^(alpha;beta)_(beta)+R^(alpha)_(beta)A^(beta)],[=aleph{[(1)/(epsi^(2))k^(beta)k_(beta)(a^(alpha)+epsib^(alpha)+epsi^(2)c^(alpha)+cdots)-2(i)/( epsi)k^(beta)(a^(alpha)+epsib^(alpha)+cdots)_(;beta):}],[(22.29){:-(i)/( epsi)k^(beta)_(;beta)(a^(alpha)+epsib^(alpha)+cdots)-(a^(alpha)+cdots)^(;beta)_(beta)+R^(alpha)_(beta)(a^(beta)+cdots)]e^(i theta//epsi)}.]:}\begin{align*} 0= & \left(\Delta_{d R} \boldsymbol{A}\right)^{\alpha}=-A^{\alpha ; \beta}{ }_{\beta}+R^{\alpha}{ }_{\beta} A^{\beta} \\ = & \mathfrak{\aleph}\left\{\left[\frac{1}{\varepsilon^{2}} k^{\beta} k_{\beta}\left(a^{\alpha}+\varepsilon b^{\alpha}+\varepsilon^{2} c^{\alpha}+\cdots\right)-2 \frac{i}{\varepsilon} k^{\beta}\left(a^{\alpha}+\varepsilon b^{\alpha}+\cdots\right)_{; \beta}\right.\right. \\ & \left.\left.-\frac{i}{\varepsilon} k^{\beta}{ }_{; \beta}\left(a^{\alpha}+\varepsilon b^{\alpha}+\cdots\right)-\left(a^{\alpha}+\cdots\right)^{; \beta}{ }_{\beta}+R^{\alpha}{ }_{\beta}\left(a^{\beta}+\cdots\right)\right] e^{i \theta / \varepsilon}\right\} . \tag{22.29} \end{align*}0=(ΔdRA)α=Aα;ββ+RαβAβ={[1ε2kβkβ(aα+εbα+ε2cα+)2iεkβ(aα+εbα+);β(22.29)iεkβ;β(aα+εbα+)(aα+);ββ+Rαβ(aβ+)]eiθ/ε}.
Collect terms of order 1 / ε 2 1 / ε 2 1//epsi^(2)1 / \varepsilon^{2}1/ε2 and 1 / ε 1 / ε 1//epsi1 / \varepsilon1/ε (terms of order higher than 1 / ε 1 / ε 1//epsi1 / \varepsilon1/ε govern post-geometric-optics corrections):
Box 22.3 GEOMETRY OF AN ELECTROMAGNETIC WAVE TRAIN
The drawing shows surfaces of constant phase, θ = θ = theta=\theta=θ= constant, emerging through the "surface of simultaneity", t = 0 t = 0 t=0t=0t=0, of a local Lorentz frame. The surfaces shown are alternately "crests" ( θ = 1764 π , θ = 1766 π , ) ( θ = 1764 π , θ = 1766 π , ) (theta=1764 pi,theta=1766 pi,dots)(\theta=1764 \pi, \theta=1766 \pi, \ldots)(θ=1764π,θ=1766π,) and "troughs" ( θ = 1765 π , θ = ( θ = 1765 π , θ = (theta=1765 pi,theta=(\theta=1765 \pi, \theta=(θ=1765π,θ= 1767 π , 1767 π , 1767 pi,dots1767 \pi, \ldots1767π, ) of the wave train. These surfaces make up a 1 -form, k ~ = d θ k ~ = d θ tilde(k)=d theta\tilde{\boldsymbol{k}}=\boldsymbol{d} \thetak~=dθ. The "corresponding vector" k = θ k = θ k=grad theta\boldsymbol{k}=\boldsymbol{\nabla} \thetak=θ is the "wave vector." The wave vector is null, k k = 0 k k = 0 k*k=0\boldsymbol{k} \cdot \boldsymbol{k}=0kk=0, according to Maxwell's equations [equation (22.30)]. Therefore it lies in a surface of constant phase:
( number of surfaces pierced by k ) = d θ , k = κ ~ , k = k k = 0 . (  number of surfaces   pierced by  k ) = d θ , k = κ ~ , k = k k = 0 . ((" number of surfaces ")/(" pierced by "k))=(:d theta,k:)=(: widetilde(kappa),k:)=k*k=0.\binom{\text { number of surfaces }}{\text { pierced by } \boldsymbol{k}}=\langle\boldsymbol{d} \theta, \boldsymbol{k}\rangle=\langle\widetilde{\boldsymbol{\kappa}}, \boldsymbol{k}\rangle=\boldsymbol{k} \cdot \boldsymbol{k}=0 .( number of surfaces  pierced by k)=dθ,k=κ~,k=kk=0.
But not only does it lie in a surface of constant phase; it is also perpendicular to that surface! Any vector v v v\boldsymbol{v}v in that surface must satisfy k v = k ~ , v = d θ , v = 0 k v = k ~ , v = d θ , v = 0 k*v=(: widetilde(k),v:)=(:d theta,v:)=0\boldsymbol{k} \cdot \boldsymbol{v}=\langle\widetilde{\boldsymbol{k}}, \boldsymbol{v}\rangle=\langle\boldsymbol{d} \theta, \boldsymbol{v}\rangle=0kv=k~,v=dθ,v=0 because it pierces no surfaces.
Geometric optics assumes that the reduced wavelength λ λ lambda\lambdaλ, as measured in a typical local Lorentz frame, is small compared to the scale L L L\mathcal{L}L of inhomogeneities in the wave train and small compared to the radius of curvature of spacetime, R R R\mathscr{R}R. Thus, over regions much larger than λ λ lambda\lambdaλ but smaller than E E E\mathcal{E}E or R R R\mathscr{R}R, the waves are plane-fronted
and monochromatic, and there exist Lorentz reference frames (Riemann normal coordinates). In one of these "extended" local Lorentz frames, the phase must be
θ = k α x α + constant ; θ = k α x α +  constant  ; theta=k_(alpha)x^(alpha)+" constant ";\theta=k_{\alpha} x^{\alpha}+\text { constant } ;θ=kαxα+ constant ;
no other expression will yield θ = k θ = k grad theta=k\boldsymbol{\nabla} \theta=\boldsymbol{k}θ=k. The corresponding vector potential [equation (22.25)] will be
A μ = { a μ exp [ i ( k x k 0 t ) ] } + ( "post-geometric-optics corrections"); A μ = a μ exp i k x k 0 t + (  "post-geometric-optics corrections");  A^(mu)=aleph{a^(mu)exp[i(k*x-k^(0)t)]}+(" "post-geometric-optics corrections"); "A^{\mu}=\boldsymbol{\aleph}\left\{a^{\mu} \exp \left[i\left(\boldsymbol{k} \cdot \boldsymbol{x}-k^{0} t\right)\right]\right\}+(\text { "post-geometric-optics corrections"); }Aμ={aμexp[i(kxk0t)]}+( "post-geometric-optics corrections"); 
hence,
k 0 = 2 π / ( period of wave ) = 2 π ν = ω (angular frequency ) , | k | = 2 π / ( wavelength of wave ) = 1 / λ = ω k 0 = 2 π / (  period of wave  ) = 2 π ν = ω  (angular frequency  , | k | = 2 π / (  wavelength of wave  ) = 1 / λ = ω {:[{:k^(0)=2pi//(" period of wave ")=2pi nu=omega-=" (angular frequency ")","],[|k|=2pi//(" wavelength of wave ")=1//lambda=omega]:}\begin{gathered} \left.k^{0}=2 \pi /(\text { period of wave })=2 \pi \nu=\omega \equiv \text { (angular frequency }\right), \\ |\boldsymbol{k}|=2 \pi /(\text { wavelength of wave })=1 / \lambda=\omega \end{gathered}k0=2π/( period of wave )=2πν=ω (angular frequency ),|k|=2π/( wavelength of wave )=1/λ=ω
k k k\boldsymbol{k}k points along direction of propagation of wave.
At each event in spacetime there is a wave vector; and these wave vectors, tacked end-on-end, form a family of curves-the "light rays" or simply "rays"-whose tangent vector is k k k\boldsymbol{k}k. The rays, like their tangent vector, lie both in and perpendicular to the surfaces of constant phase.
The affine parameter λ λ lambda\lambdaλ of a ray (not to be confused with wavelength = 2 π λ = 2 π λ =2pi lambda=2 \pi \lambda=2πλ ) satisfies k = d / d λ k = d / d λ k=d//d lambda\boldsymbol{k}=d / d \lambdak=d/dλ; therefore it is given by
λ = t / k 0 + constant = t / ω + constant λ = t / k 0 +  constant  = t / ω +  constant  lambda=t//k^(0)+" constant "=t//omega+" constant "\lambda=t / k^{0}+\text { constant }=t / \omega+\text { constant }λ=t/k0+ constant =t/ω+ constant 
where t t ttt is proper time along the ray as measured, not by the ray itself (its proper time is zero!), but by the local Lorentz observer who sees angular frequency ω ω omega\omegaω. Thus, while ω ω omega\omegaω is a frame-dependent quantity and t t ttt is also a frame-dependent quantity, their quotient t / ω t / ω t//omegat / \omegat/ω when measured along the ray (not off the ray) is the frame-independent affine parameter. For a particle it is possible and natural to identify the affine parameter λ λ lambda\lambdaλ with proper time τ τ tau\tauτ. For a light ray this identification is unnatural and impossible. The lapse of proper time along the ray is identically zero. The springing up of λ λ lambda\lambdaλ to take the place of the vanished τ τ tau\tauτ gives one a tool to do what one might not have suspected to be possible. Given a light ray shot out at event a a a\mathscr{a}a and passing through event B B B\mathscr{B}B, one can give a third event C C C\mathcal{C}C along the same null world line that is twice as "far" from a a aaa as B B B\mathscr{B}B is "far," in a new sense of "far" that has nothing whatever directly to do with proper time (zero!), but is defined by equal increments of the affine parameter ( λ e λ g h = λ g h λ d ) λ e λ g h = λ g h λ d (lambda_(e)-lambda_(gh)=lambda_(gh)-lambda_(d))\left(\lambda_{e}-\lambda_{g h}=\lambda_{g h}-\lambda_{d}\right)(λeλgh=λghλd). The "affine parameter" has a meaning for any null geodesic analyzed even in isolation. In this respect, it is to be distinguished from the so-called "luminosity distance" which is sometimes introduced in dealing with the propagation of radiation through curved spacetime, and which is defined by the spreading apart of two or more light rays coming from a common source.
Maxwell's equations as explored in the text [equation (22.28')] guarantee that the complex polarization vector f f f\boldsymbol{f}f is perpendicular to the wave vector k k k\boldsymbol{k}k and that, therefore, it lies in a surface of constant phase (see drawing). Intuition into the polarization vector is developed in exercise 22.12.
0 ( 1 ε 2 ) : k β k β a α = 0 (22.30) k k = 0 (wave vector is null); 0 ( 1 ε ) : k β k β b α 2 i ( k β a α ; β + 1 2 k β ; β a α ) = 0 ζ = 0 ] (22.31) k a = 1 2 ( k ) a (propagation equation for vector amplitude). 0 1 ε 2 : k β k β a α = 0 (22.30) k k = 0  (wave vector is null);  0 1 ε : k β k β b α 2 i k β a α ; β + 1 2 k β ; β a α = 0 ζ = 0 ] (22.31) k a = 1 2 ( k ) a  (propagation equation for vector amplitude).  {:[0((1)/(epsi^(2))):quadk^(beta)k_(beta)a^(alpha)=0],[(22.30) Longrightarrow k*k=0" (wave vector is null); "],[0((1)/(epsi)):quadubrace(k^(beta)k_(beta)b^(alpha)-2i(k^(beta)a^(alpha)_(;beta)+(1)/(2)k^(beta)_(;beta)a^(alpha))=0ubrace)_(zeta=0])],[(22.31) Longrightarrowgrad_(k)a=-(1)/(2)(grad*k)a" (propagation equation for vector amplitude). "]:}\begin{align*} & 0\left(\frac{1}{\varepsilon^{2}}\right): \quad k^{\beta} k_{\beta} a^{\alpha}=0 \\ & \Longrightarrow \boldsymbol{k} \cdot \boldsymbol{k}=0 \text { (wave vector is null); } \tag{22.30}\\ & 0\left(\frac{1}{\varepsilon}\right): \quad \underbrace{k^{\beta} k_{\beta} b^{\alpha}-2 i\left(k^{\beta} a^{\alpha}{ }_{; \beta}+\frac{1}{2} k^{\beta}{ }_{; \beta} a^{\alpha}\right)=0}_{\zeta=0]} \\ & \Longrightarrow \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{a}=-\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k}) \boldsymbol{a} \text { (propagation equation for vector amplitude). } \tag{22.31} \end{align*}0(1ε2):kβkβaα=0(22.30)kk=0 (wave vector is null); 0(1ε):kβkβbα2i(kβaα;β+12kβ;βaα)=0ζ=0](22.31)ka=12(k)a (propagation equation for vector amplitude). 
These equations ( 22.30 , 22.31 ) ( 22.30 , 22.31 ) (22.30,22.31)(22.30,22.31)(22.30,22.31) together with equation (22.28) are the basis from which all subsequent results will follow. As a first consequence, one can obtain the geodesic law from equation (22.30). Form the gradient of k k = 0 k k = 0 k*k=0\boldsymbol{k} \cdot \boldsymbol{k}=0kk=0,
0 = ( k β k β ) ; α = 2 k β k β ; α 0 = k β k β ; α = 2 k β k β ; α 0=(k^(beta)k_(beta))_(;alpha)=2k^(beta)k_(beta;alpha)0=\left(k^{\beta} k_{\beta}\right)_{; \alpha}=2 k^{\beta} k_{\beta ; \alpha}0=(kβkβ);α=2kβkβ;α
and use the fact that k β θ , β k β θ , β k_(beta)-=theta_(,beta)k_{\beta} \equiv \theta_{, \beta}kβθ,β is the gradient of a scalar to interchange indices, θ ; β α = θ ; α β θ ; β α = θ ; α β theta_(;beta alpha)=theta_(;alpha beta)\theta_{; \beta \alpha}=\theta_{; \alpha \beta}θ;βα=θ;αβ or
0 = k β k β ; α = k β k α ; β . 0 = k β k β ; α = k β k α ; β . 0=k^(beta)k_(beta;alpha)=k^(beta)k_(alpha;beta).0=k^{\beta} k_{\beta ; \alpha}=k^{\beta} k_{\alpha ; \beta} .0=kβkβ;α=kβkα;β.
The main laws of geometric optics:
The result is
(22.32) k k = 0 (propagation equation for wave vector). (22.32) k k = 0  (propagation equation for wave vector).  {:(22.32)grad_(k)k=0" (propagation equation for wave vector). ":}\begin{equation*} \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{k}=0 \text { (propagation equation for wave vector). } \tag{22.32} \end{equation*}(22.32)kk=0 (propagation equation for wave vector). 
Notice that this is the geodesic equation! Combined with equation (22.30), it is the statement, derived from Maxwell's equations in curved spacetime, that light rays are null geodesics, the first main result of geometric optics.
Turn now from the propagation vector k = θ k = θ k=grad theta\boldsymbol{k}=\boldsymbol{\nabla} \thetak=θ to the wave amplitude a = a f a = a f a=af\boldsymbol{a}=a \boldsymbol{f}a=af, and obtain separate equations for the magnitude a a aaa and polarization f f f\boldsymbol{f}f. Use equation (22.31) to compute
2 a k a = 2 a k a = k a 2 = k ( a a ) = a k a + a k a = 1 2 ( k ) ( a a + a a ) = a 2 k ; 2 a k a = 2 a k a = k a 2 = k ( a a ¯ ) = a ¯ k a + a k a ¯ = 1 2 ( k ) ( a ¯ a + a a ¯ ) = a 2 k ; {:[2adel_(k)a=2agrad_(k)a=grad_(k)a^(2)=grad_(k)(a* bar(a))= bar(a)*grad_(k)a+a*grad_(k) bar(a)],[=-(1)/(2)(grad*k)( bar(a)*a+a* bar(a))=-a^(2)grad*k;]:}\begin{aligned} 2 a \partial_{\boldsymbol{k}} a & =2 a \boldsymbol{\nabla}_{\boldsymbol{k}} a=\boldsymbol{\nabla}_{\boldsymbol{k}} a^{2}=\boldsymbol{\nabla}_{\boldsymbol{k}}(\boldsymbol{a} \cdot \overline{\boldsymbol{a}})=\overline{\boldsymbol{a}} \cdot \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{a}+\boldsymbol{a} \cdot \boldsymbol{\nabla}_{\boldsymbol{k}} \overline{\boldsymbol{a}} \\ & =-\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k})(\overline{\boldsymbol{a}} \cdot \boldsymbol{a}+\boldsymbol{a} \cdot \overline{\boldsymbol{a}})=-a^{2} \boldsymbol{\nabla} \cdot \boldsymbol{k} ; \end{aligned}2aka=2aka=ka2=k(aa)=aka+aka=12(k)(aa+aa)=a2k;
so
(22.33) k a = 1 2 ( k ) a ( propagation equation for scalar amplitude ) . (22.33) k a = 1 2 ( k ) a (  propagation equation for scalar amplitude  ) . {:(22.33)del_(k)a=-(1)/(2)(grad*k)a(" propagation equation for scalar amplitude ").:}\begin{equation*} \partial_{\boldsymbol{k}} a=-\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k}) a(\text { propagation equation for scalar amplitude }) . \tag{22.33} \end{equation*}(22.33)ka=12(k)a( propagation equation for scalar amplitude ).
Next write a = a f a = a f a=af\boldsymbol{a}=a \boldsymbol{f}a=af in equation (22.31) to obtain
0 = k ( a f ) + 1 2 ( k ) a f = a k f + f [ k a + 1 2 ( k ) a ] = a k f 0 = k ( a f ) + 1 2 ( k ) a f = a k f + f k a + 1 2 ( k ) a = a k f 0=grad_(k)(af)+(1)/(2)(grad*k)af=agrad_(k)f+f[grad_(k)a+(1)/(2)(grad*k)a]=agrad_(k)f0=\boldsymbol{\nabla}_{\boldsymbol{k}}(a \boldsymbol{f})+\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k}) a \boldsymbol{f}=a \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{f}+\boldsymbol{f}\left[\boldsymbol{\nabla}_{\boldsymbol{k}} a+\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k}) a\right]=a \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{f}0=k(af)+12(k)af=akf+f[ka+12(k)a]=akf
or
(22.34) k f = 0 (propagation equation for polarization vector). (22.34) k f = 0  (propagation equation for polarization vector).  {:(22.34)grad_(k)f=0" (propagation equation for polarization vector). ":}\begin{equation*} \boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{f}=0 \text { (propagation equation for polarization vector). } \tag{22.34} \end{equation*}(22.34)kf=0 (propagation equation for polarization vector). 
This together with equation (22.28'), constitutes the second main result of geometric optics, that the polarization vector is perpendicular to the rays and is parallel-propagated along the rays. It is now possible to see that these results, derived from equations (22.30) and (22.31) are consistent with the gauge condition (22.28). The vectors k k k\boldsymbol{k}k and f f f\boldsymbol{f}f, specified at one point, are fixed along the entire ray by their propagation equations. But because both propagation equations are parallel-transport laws, the conditions k k = 0 , f f = 1 k k = 0 , f f ¯ = 1 k*k=0,f* bar(f)=1\boldsymbol{k} \cdot \boldsymbol{k}=0, \boldsymbol{f} \cdot \overline{\boldsymbol{f}}=1kk=0,ff=1, and k f = 0 k f = 0 k*f=0\boldsymbol{k} \cdot \boldsymbol{f}=0kf=0, once imposed on the vectors at one point, will be satisfied along the entire ray.
The equation (22.33) for the scalar amplitude can be reformulated as a conservation law. Since k ( k ) k ( k ) del_(k)-=(k*grad)\partial_{\boldsymbol{k}} \equiv(\boldsymbol{k} \cdot \boldsymbol{\nabla})k(k), one rewrites the equation as ( k ) a 2 + a 2 k = 0 ( k ) a 2 + a 2 k = 0 (k*grad)a^(2)+a^(2)grad*k=0(\boldsymbol{k} \cdot \boldsymbol{\nabla}) a^{2}+a^{2} \boldsymbol{\nabla} \cdot \boldsymbol{k}=0(k)a2+a2k=0, or
(22.35) ( a 2 k ) = 0 (22.35) a 2 k = 0 {:(22.35)grad*(a^(2)k)=0:}\begin{equation*} \boldsymbol{\nabla} \cdot\left(a^{2} \boldsymbol{k}\right)=0 \tag{22.35} \end{equation*}(22.35)(a2k)=0
Consequently the vector a 2 k a 2 k a^(2)ka^{2} \boldsymbol{k}a2k is a "conserved current," and the integral a 2 k μ d 3 Σ μ a 2 k μ d 3 Σ μ inta^(2)k^(mu)d^(3)Sigma_(mu)\int a^{2} k^{\mu} d^{3} \Sigma_{\mu}a2kμd3Σμ has a fixed, unchanging value for each 3 -volume cutting a given tube formed of light rays. (The tube must be so formed of rays that an integral of a 2 k a 2 k a^(2)ka^{2} \boldsymbol{k}a2k over the walls of the tube will give zero.) What is conserved? To remain purely classical, one could say it is the "number of light rays" and call a 2 k 0 a 2 k 0 a^(2)k^(0)a^{2} k^{0}a2k0 the "density of light rays" on an x 0 = x 0 = x^(0)=x^{0}=x0= constant hypersurface. But the proper correspondence and more concrete physical interpretation make one prefer to call equation (22.35) the law of conservation of photon number. It is the third main result of geometric optics. Photon number, of course, is not always conserved; it is an adiabatic invariant, a quantity that is not changed by influences (e.g., spacetime curvature, 1 / R 2 1 / R 2 ∼1//R^(2)\sim 1 / \mathscr{R}^{2}1/R2 ) which change slowly ( λ ) ( λ ) (ℜ≫lambda)(\Re \gg \lambda)(λ) compared to the photon frequency.
Box 22.4 summarizes the above equations of geometric optics, along with others derived in the exercises.
(2) polarization vector is perpendicular to ray and is parallel propagated along ray
(3) conservation of "photon number"

Exercise 22.11. ELECTROMAGNETIC FIELD AND STRESS ENERGY

Derive the equations given in part D of Box 22.4 for F , E , B F , E , B F,E,B\boldsymbol{F}, \boldsymbol{E}, \boldsymbol{B}F,E,B, and T T T\boldsymbol{T}T.

Exercise 22.12. POLARIZATION

At an event P 0 P 0 P_(0)\mathscr{P}_{0}P0 through which geometric-optics waves are passing, introduce a local Lorentz frame with z z zzz-axis along the direction of propagation. Then k = ω ( e 0 + e z ) k = ω e 0 + e z k=omega(e_(0)+e_(z))\boldsymbol{k}=\omega\left(\boldsymbol{e}_{0}+\boldsymbol{e}_{z}\right)k=ω(e0+ez). Since the polarization vector is orthogonal to k k k\boldsymbol{k}k, it is f = f 0 ( e 0 + e z ) + f 1 e x + f 2 e y f = f 0 e 0 + e z + f 1 e x + f 2 e y f=f^(0)(e_(0)+e_(z))+f^(1)e_(x)+f^(2)e_(y)\boldsymbol{f}=f^{0}\left(\boldsymbol{e}_{0}+\boldsymbol{e}_{z}\right)+f^{1} \boldsymbol{e}_{x}+f^{2} \boldsymbol{e}_{y}f=f0(e0+ez)+f1ex+f2ey; and since f f = 1 f f ¯ = 1 f* bar(f)=1\boldsymbol{f} \cdot \overline{\boldsymbol{f}}=1ff=1, it has | f 1 | 2 + | f 2 | 2 = 1 f 1 2 + f 2 2 = 1 |f^(1)|^(2)+|f^(2)|^(2)=1\left|f^{1}\right|^{2}+\left|f^{2}\right|^{2}=1|f1|2+|f2|2=1.
(a) Show that the component f 0 f 0 f^(0)f^{0}f0 of the polarization vector has no influence on the electric and magnetic fields measured in the given frame; i.e., show that one can add a multiple of k k k\boldsymbol{k}k to f f f\boldsymbol{f}f without affecting any physical measurements.
(continued on page 581)

EXERCISES

Box 22.4 GEOMETRIC OPTICS IN CURVED SPACETIME (Summary of Results Derived in Text and Exercises)

A. Geometric Optics Assumption

Electromagnetic waves propagating in a source-free region of spacetime are locally plane-fronted and monochromatic (reduced wavelength λ λ lambda≪\lambda \llλ scale E E E\mathcal{E}E over which amplitude, wavelength, or polarization vary; and λ R = λ R = lambda≪R=\lambda \ll \mathscr{R}=λR= mean radius of curvature of spacetime).

B. Rays, Phase, and Wave Vector (see Box 22.3)

Everything (amplitude, polarization, energy, etc.) is transported along rays; and the quantities on one ray do not influence the quantities on any other ray.
The rays are null geodesics of curved spacetime, with tangent vectors ("wave vectors") k k k\boldsymbol{k}k :
k k = 0 k k = 0 grad_(k)k=0\boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{k}=0kk=0
The rays both lie in and are perpendicular to surfaces of constant phase, θ = θ = theta=\theta=θ= const.; and their tangent vectors are the gradient of θ θ theta\thetaθ :
k = θ . k = θ . k=grad theta.\boldsymbol{k}=\boldsymbol{\nabla} \theta .k=θ.
In a local Lorentz frame, k 0 k 0 k^(0)k^{0}k0 is the "angular frequency" and k 0 / 2 π k 0 / 2 π k^(0)//2pik^{0} / 2 \pik0/2π is the ordinary frequency of the waves, and
n = k / k 0 n = k / k 0 n=k//k^(0)n=k / k^{0}n=k/k0
is a unit 3 -vector pointing along their direction of propagation.

C. Amplitude and Polarization Vector

The waves are characterized by a real amplitude a a aaa and a complex polarization vector f f f\boldsymbol{f}f of unit length, f f = 1 f f ¯ = 1 f* bar(f)=1\boldsymbol{f} \cdot \overline{\boldsymbol{f}}=1ff=1. (Of the fundamental quantities θ , k , a , f θ , k , a , f theta,k,a,f\theta, \boldsymbol{k}, a, \boldsymbol{f}θ,k,a,f, all are real except f f f\boldsymbol{f}f. See exercise 22.12 for deeper understanding of f f f\boldsymbol{f}f.)
The polarization vector is everywhere orthogonal to the rays, k f = 0 k f = 0 k*f=0\boldsymbol{k} \cdot \boldsymbol{f}=0kf=0; and is parallel-transported along them, k f = 0 k f = 0 grad_(k)f=0\boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{f}=0kf=0.
The propagation law for the amplitude is
k a = 1 2 ( K ) a . k a = 1 2 ( K ) a . del_(k)a=-(1)/(2)(grad*K)a.\partial_{\boldsymbol{k}} a=-\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{K}) a .ka=12(K)a.
This propagation law is equivalent to a law of conservation of photons (classically: of rays); a 2 k a 2 k a^(2)ka^{2} \boldsymbol{k}a2k is the "conserved current" satisfying ( a 2 k ) = 0 a 2 k = 0 grad*(a^(2)k)=0\boldsymbol{\nabla} \cdot\left(a^{2} \boldsymbol{k}\right)=0(a2k)=0; and ( 8 π ) 1 a 2 k 0 | g | d 3 x ( 8 π ) 1 a 2 k 0 | g | d 3 x (8piℏ)^(-1)inta^(2)k^(0)sqrt(|g|)d^(3)x(8 \pi \hbar)^{-1} \int a^{2} k^{0} \sqrt{|g|} d^{3} x(8π)1a2k0|g|d3x is the number of photons (rays) in the 3 -volume of integration on any x 0 = x 0 = x^(0)=x^{0}=x0= constant hypersurface, and is constant as this volume is carried along the rays.
The propagation law holds separately on each hypersurface of constant phase. There it can be interpreted as conservation of a a 2 a a 2 a a^(2)aa^{2} aa2a, where a a aaa is a two-dimensional cross-sectional area of a pulse of photons or rays. See exercise 22.13.

D. Vector Potential, Electromagnetic Field, and Stress-Energy-Momentum

At any event the vector potential in Lorentz gauge is
A = x { a e i θ f } , A = x a e i θ f , A=x{ae^(i theta)f},\boldsymbol{A}=\mathbf{x}\left\{a e^{i \theta} \boldsymbol{f}\right\},A=x{aeiθf},
where k i k i k_(i)\mathrm{k}_{\mathrm{i}}ki denotes the real part.
The electromagnetic field tensor is orthogonal to the rays, F k = 0 F k = 0 F*k=0\boldsymbol{F} \cdot \boldsymbol{k}=0Fk=0, and is given by
F = x { i a e i θ k f } . F = x i a e i θ k f . F=x{iae^(i theta)k^^f}.\boldsymbol{F}=\mathbf{x}\left\{i a e^{i \theta} \boldsymbol{k} \wedge \boldsymbol{f}\right\} .F=x{iaeiθkf}.
The corresponding electric and magnetic fields in any local Lorentz frame are
E = i { i a k 0 e i θ ( projection of f perpendicular to k ) } , B = n × E , where n k / k 0 . E = i i a k 0 e i θ (  projection of  f  perpendicular to  k ) , B = n × E ,  where  n k / k 0 . {:[E=i{iak^(0)e^(i theta)(" projection of "f" perpendicular to "k)}","],[B=n xx E","" where "n-=k//k^(0).]:}\begin{gathered} \boldsymbol{E}=\boldsymbol{\mathfrak { i }}\left\{i a k^{0} e^{i \theta}(\text { projection of } \boldsymbol{f} \text { perpendicular to } \boldsymbol{k})\right\}, \\ \boldsymbol{B}=\boldsymbol{n} \times \boldsymbol{E}, \text { where } \boldsymbol{n} \equiv \boldsymbol{k} / k^{0} . \end{gathered}E=i{iak0eiθ( projection of f perpendicular to k)},B=n×E, where nk/k0.
The stress-energy tensor, averaged over a wavelength, is
T = ( 1 / 8 π ) a 2 k k T = ( 1 / 8 π ) a 2 k k T=(1//8pi)a^(2)k ox k\boldsymbol{T}=(1 / 8 \pi) a^{2} \boldsymbol{k} \otimes \boldsymbol{k}T=(1/8π)a2kk
corresponding to an energy density in a local Lorentz frame of
T 00 = ( 1 / 8 π ) ( a k 0 ) 2 T 00 = ( 1 / 8 π ) a k 0 2 T^(00)=(1//8pi)(ak^(0))^(2)T^{00}=(1 / 8 \pi)\left(a k^{0}\right)^{2}T00=(1/8π)(ak0)2
and an energy flux of
T 0 j = T 00 n j T 0 j = T 00 n j T^(0j)=T^(00)n^(j)T^{0 j}=T^{00} n^{j}T0j=T00nj
so that energy flows along the rays (in n = k / k 0 n = k / k 0 n=k//k^(0)\boldsymbol{n}=\boldsymbol{k} / k^{0}n=k/k0 direction) with the speed of light. This is identical with the stress-energy tensor that would be produced by a beam of photons with 4-momenta p = k p = k p=ℏk\boldsymbol{p}=\hbar \boldsymbol{k}p=k.
Conservation of energy-momentum T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 follows from the ray conservation law ( a 2 k ) = 0 a 2 k = 0 grad*(a^(2)k)=0\boldsymbol{\nabla} \cdot\left(a^{2} \boldsymbol{k}\right)=0(a2k)=0 and the geodesic law k k ( k ) k = 0 k k ( k ) k = 0 grad_(k)k-=(k*grad)k=0\boldsymbol{\nabla}_{\boldsymbol{k}} \boldsymbol{k} \equiv(\boldsymbol{k} \cdot \boldsymbol{\nabla}) \boldsymbol{k}=0kk(k)k=0 :
8 π T = ( a 2 k k ) = [ ( a 2 k ) ] k + a 2 ( k ) k = 0 . 8 π T = a 2 k k = a 2 k k + a 2 ( k ) k = 0 . 8pi grad*T=grad*(a^(2)k ox k)=[grad*(a^(2)k)]k+a^(2)(k*grad)k=0.8 \pi \boldsymbol{\nabla} \cdot \boldsymbol{T}=\boldsymbol{\nabla} \cdot\left(a^{2} \boldsymbol{k} \otimes \boldsymbol{k}\right)=\left[\boldsymbol{\nabla} \cdot\left(a^{2} \boldsymbol{k}\right)\right] \boldsymbol{k}+a^{2}(\boldsymbol{k} \cdot \boldsymbol{\nabla}) \boldsymbol{k}=0 .8πT=(a2kk)=[(a2k)]k+a2(k)k=0.

Box 22.4 (continued)

The adiabatic (geometric optics) invariant "ray number" a 2 k 0 a 2 k 0 a^(2)k^(0)a^{2} k^{0}a2k0 or "photon number" ( 8 π ) 1 a 2 k 0 ( 8 π ) 1 a 2 k 0 (8piℏ)^(-1)a^(2)k^(0)(8 \pi \hbar)^{-1} a^{2} k^{0}(8π)1a2k0 in a unit volume is proportional to the energy, ( 8 π ) 1 a 2 ( k 0 ) 2 ( 8 π ) 1 a 2 k 0 2 (8pi)^(-1)a^(2)(k^(0))^(2)(8 \pi)^{-1} a^{2}\left(k^{0}\right)^{2}(8π)1a2(k0)2, divided by the frequency, k 0 k 0 k^(0)k^{0}k0-corresponding exactly to the harmonic oscillator adiabatic invariant E / ω E / ω E//omegaE / \omegaE/ω [Einstein (1912), Ehrenfest (1916), Landau and Lifshitz (1960)].

E. Photon Reinterpretation of Geometric Optics

The laws of geometric optics can be reinterpreted as follows. This reinterpretation becomes a foundation of the standard quantum theory of the electromagnetic field (see, e.g., Chapters 1 and 13 of Baym (1969)]; and the classical limit of that quantum theory is standard Maxwell electrodynamics.
Photons are particles of zero rest mass that move along null geodesics of spacetime (the null rays).
The 4-momentum of a photon is related to the tangent vector of the null ray (wave vector) by p = k p = k p=ℏk\boldsymbol{p}=\hbar \boldsymbol{k}p=k. A renormalization of the affine parameter,
( new parameter ) = ( 1 / ) × ( old parameter ) , (  new parameter  ) = ( 1 / ) × (  old parameter  ) , (" new parameter ")=(1//ℏ)xx(" old parameter "),(\text { new parameter })=(1 / \hbar) \times(\text { old parameter }),( new parameter )=(1/)×( old parameter ),
makes p p p\boldsymbol{p}p the tangent vector to the ray.
Each photon possesses a polarization vector, f f f\boldsymbol{f}f, which is orthogonal to its 4 -momentum ( p f = 0 ) ( p f = 0 ) (p*f=0)(\boldsymbol{p} \cdot \boldsymbol{f}=0)(pf=0), and which it parallel-transports along its geodesic world line ( p f = 0 ) p f = 0 (grad_(p)f=0)\left(\nabla_{p} f=0\right)(pf=0).
A swarm of photons, all with nearly the same 4-momentum p p p\boldsymbol{p}p and polarization vector f f f\boldsymbol{f}f (as compared by parallel transport), make up a classical electromagnetic wave. The scalar amplitude a a aaa of the wave is determined by equating the stress-energy tensor of the wave
T = 1 8 π a 2 k k = 1 8 π ( a ) 2 p p T = 1 8 π a 2 k k = 1 8 π a 2 p p T=(1)/(8pi)a^(2)k ox k=(1)/(8pi)((a)/(ℏ))^(2)p ox p\boldsymbol{T}=\frac{1}{8 \pi} a^{2} \boldsymbol{k} \otimes \boldsymbol{k}=\frac{1}{8 \pi}\left(\frac{a}{\hbar}\right)^{2} \boldsymbol{p} \otimes \boldsymbol{p}T=18πa2kk=18π(a)2pp
to the stress-energy tensor of a swarm of photons with number-flux vector S S S\boldsymbol{S}S,
T = p S T = p S T=p ox S\boldsymbol{T}=\boldsymbol{p} \otimes \boldsymbol{S}T=pS
[see equation (5.18)]. The result:
S = 1 8 π ( a ) 2 p = 1 8 π a 2 k S = 1 8 π a 2 p = 1 8 π a 2 k S=(1)/(8pi)((a)/(ℏ))^(2)p=(1)/(8piℏ)a^(2)k\boldsymbol{S}=\frac{1}{8 \pi}\left(\frac{a}{\hbar}\right)^{2} \boldsymbol{p}=\frac{1}{8 \pi \hbar} a^{2} \boldsymbol{k}S=18π(a)2p=18πa2k
or, in any local Lorentz frame,
a = ( 8 π 2 S 0 / p 0 ) 1 / 2 = ( 8 π ) 1 / 2 ( number density of photons energy of one photon ) 1 / 2 . a = 8 π 2 S 0 / p 0 1 / 2 = ( 8 π ) 1 / 2  number density of photons   energy of one photon  1 / 2 . a=(8piℏ^(2)S^(0)//p^(0))^(1//2)=(8pi)^(1//2)ℏ((" number density of photons ")/(" energy of one photon "))^(1//2).a=\left(8 \pi \hbar^{2} S^{0} / p^{0}\right)^{1 / 2}=(8 \pi)^{1 / 2} \hbar\left(\frac{\text { number density of photons }}{\text { energy of one photon }}\right)^{1 / 2} .a=(8π2S0/p0)1/2=(8π)1/2( number density of photons  energy of one photon )1/2.
(b) Show that the following polarization vectors correspond to the types of polarization listed:
f = e x , linear polarization in x direction; f = e y , linear polarization in y direction; f = 1 2 ( e x + i e y ) , righthand circular polarization; f = 1 2 ( e x i e y ) , lefthand circular polarization; f = α e x + i ( 1 α 2 ) 1 / 2 e y , righthand elliptical polarization. f = e x ,  linear polarization in  x  direction;  f = e y ,  linear polarization in  y  direction;  f = 1 2 e x + i e y ,  righthand circular polarization;  f = 1 2 e x i e y ,  lefthand circular polarization;  f = α e x + i 1 α 2 1 / 2 e y ,  righthand elliptical polarization.  {:[f=e_(x)","" linear polarization in "x" direction; "],[f=e_(y)","" linear polarization in "y" direction; "],[f=(1)/(sqrt2)(e_(x)+ie_(y))","" righthand circular polarization; "],[f=(1)/(sqrt2)(e_(x)-ie_(y))","" lefthand circular polarization; "],[f=alphae_(x)+i(1-alpha^(2))^(1//2)e_(y)","" righthand elliptical polarization. "]:}\begin{aligned} & \boldsymbol{f}=\boldsymbol{e}_{x}, \text { linear polarization in } x \text { direction; } \\ & \boldsymbol{f}=\boldsymbol{e}_{y}, \text { linear polarization in } y \text { direction; } \\ & \boldsymbol{f}=\frac{1}{\sqrt{2}}\left(\boldsymbol{e}_{x}+i \boldsymbol{e}_{y}\right), \text { righthand circular polarization; } \\ & \boldsymbol{f}=\frac{1}{\sqrt{2}}\left(\boldsymbol{e}_{x}-i \boldsymbol{e}_{y}\right), \text { lefthand circular polarization; } \\ & \boldsymbol{f}=\alpha \boldsymbol{e}_{x}+i\left(1-\alpha^{2}\right)^{1 / 2} \boldsymbol{e}_{y}, \text { righthand elliptical polarization. } \end{aligned}f=ex, linear polarization in x direction; f=ey, linear polarization in y direction; f=12(ex+iey), righthand circular polarization; f=12(exiey), lefthand circular polarization; f=αex+i(1α2)1/2ey, righthand elliptical polarization. 
(c) Show that the type of polarization (linear; circular; elliptical with given eccentricity of ellipse) is the same as viewed in any local Lorentz frame at any event along a given ray. [Hint: Use pictures and abstract calculations rather than Lorentz transformations and component calculations.]

Exercise 22.13. THE AREA OF A bundle of rays

Write equation (22.31) in a coordinate system in which one of the coordinates is chosen to be x 0 = θ x 0 = θ x^(0)=thetax^{0}=\thetax0=θ, the phase (a retarded time coordinate).
(a) Show that g 00 = 0 g 00 = 0 g^(00)=0g^{00}=0g00=0 and that no derivatives / θ / θ del//del theta\partial / \partial \theta/θ appear in equation (22.33); so propagation of a a aaa can be described within a single θ = θ = theta=\theta=θ= constant hypersurface.
(b) Perform the following construction (see Figure 22.1). Pick a ray C 0 C 0 C_(0)\mathcal{C}_{0}C0 along which a a aaa is to be propagated. Pick a bundle of rays, with two-dimensional cross section, that (i) all lie in the same constant-phase surface as E 0 E 0 E_(0)\mathcal{E}_{0}E0, and (ii) surround E 0 E 0 E_(0)\mathcal{E}_{0}E0. (The surface is three-di-
Figure 22.1.
Geometric optics for a bundle of rays with two-dimensional cross section, all lying in a surface of constant phase, θ = θ = theta=\theta=θ= const. Sketch (a) shows the bundle, surrounding a central ray e 0 e 0 e_(0)\mathfrak{e}_{0}e0, in a spacetime diagram with one spatial dimension suppressed. Sketch (b) shows the bundle as viewed on a slice of simultaneity in a local Lorentz frame at the event Φ 0 Φ 0 Phi_(0)\mathscr{\Phi}_{0}Φ0. Slicing the bundle turns each ray into a "photon"; so the bundle becomes a two-dimensional surface filled with photons. The area Ω Ω Omega\OmegaΩ of this photon-filled surface obeys the following laws (see exercises 22.13 and 22.14): (1) C C C\mathscr{C}C is independent of the choice of Lorentz frame; it depends only on location P 0 P 0 P_(0)\mathscr{\mathscr { P }}_{0}P0 along the ray C 0 C 0 C_(0)\mathcal{C}_{0}C0. (2) The amplitude a a aaa of the waves satisfies
a a 2 = constant all along the ray e 0 a a 2 =  constant all along the ray  e 0 aa^(2)=" constant all along the ray "e_(0)a a^{2}=\text { constant all along the ray } \mathcal{e}_{0}aa2= constant all along the ray e0
("conservation of photon flux"). (3) a obeys the "propagation equation" (22.36).
mensional, so any bundle filling it has a two-dimensional cross section.) At any event P 0 P 0 P_(0)\mathscr{P}_{0}P0, in any local Lorentz frame there, on a "slice of simultaneity" x 0 = x 0 = x^(0)=x^{0}=x0= constant, measure the cross-sectional area a a aaa of the bundle. (Note: the area being measured is perpendicular to k k k\boldsymbol{k}k in the three-dimensional Euclidean sense; it can be thought of as the region occupied momentarily by a group of photons propagating along, side by side, in the k k k\boldsymbol{k}k direction.) Show that the area C C C\mathbb{C}C is the same, at a given event P 0 P 0 P_(0)\mathscr{P}_{0}P0, regardless of what Lorentz frame is used to measure it; but the area changes from point to point along the ray C 0 C 0 C_(0)\mathcal{C}_{0}C0 as a result of the rays' divergence away from each other or convergence toward each other:
(22.36) k a = ( k ) a . (22.36) k a = ( k ) a . {:(22.36)del_(k)a=(grad*k)a.:}\begin{equation*} \partial_{\boldsymbol{k}} a=(\boldsymbol{\nabla} \cdot \boldsymbol{k}) a . \tag{22.36} \end{equation*}(22.36)ka=(k)a.
Then show that C a a 2 C a a 2 Caa^(2)\mathscr{C a} a^{2}Caa2 is a constant everywhere along the ray C 0 C 0 C_(0)\mathcal{C}_{0}C0 ("conservation of photon flux"). [Hints: (i) Any vector ξ ξ xi\xiξ connecting adjacent rays in the bundle is perpendicular to k k k\boldsymbol{k}k, because ξ ξ xi\boldsymbol{\xi}ξ lies in a surface of constant θ θ theta\thetaθ and k ξ = κ ~ , ξ = d θ , ξ = k ξ = κ ~ , ξ = d θ , ξ = k*xi=(: widetilde(kappa),xi:)=(:d theta,xi:)=\boldsymbol{k} \cdot \boldsymbol{\xi}=\langle\widetilde{\boldsymbol{\kappa}}, \boldsymbol{\xi}\rangle=\langle\boldsymbol{d} \theta, \boldsymbol{\xi}\rangle=kξ=κ~,ξ=dθ,ξ= (change in θ θ theta\thetaθ along ξ ξ xi\xiξ ) = 0 = 0 =0=0=0. (ii) Consider, for simplicity, a bundle with rectangular cross section as seen in a specific local Lorentz frame at a specific event P 0 P 0 P_(0)\mathscr{P}_{0}P0 [edge vectors v v v\boldsymbol{v}v and w w w\boldsymbol{w}w with v w = 0 v w = 0 v*w=0\boldsymbol{v} \cdot \boldsymbol{w}=0vw=0 (edges perpendicular) and v e 0 = w e 0 = 0 v e 0 = w e 0 = 0 v*e_(0)=w*e_(0)=0\boldsymbol{v} \cdot \boldsymbol{e}_{0}=\boldsymbol{w} \cdot \boldsymbol{e}_{0}=0ve0=we0=0 (edges in surface of constant time) and v k = w k = 0 v k = w k = 0 v*k=w*k=0\boldsymbol{v} \cdot \boldsymbol{k}=\boldsymbol{w} \cdot \boldsymbol{k}=0vk=wk=0 (since edge vectors connect adjacent rays of the bundle)]. Show pictorially that in any other Lorentz frame at P 0 P 0 P_(0)\mathscr{P}_{0}P0, the edge vectors are v = v + α k v = v + α k v^(')=v+alpha k\boldsymbol{v}^{\prime}=\boldsymbol{v}+\alpha \boldsymbol{k}v=v+αk and w = w + β k w = w + β k w^(')=w+beta k\boldsymbol{w}^{\prime}=\boldsymbol{w}+\beta \boldsymbol{k}w=w+βk for some α α alpha\alphaα and β β beta\betaβ. Conclude that in all Lorentz frames at P 0 P 0 P_(0)\mathscr{P}_{0}P0 the cross section has identical shape and identical area, and is spatially perpendicular to the direction of propagation ( k v = k w = 0 k v = k w = 0 (k*v=k*w=0:}\left(\boldsymbol{k} \cdot \boldsymbol{v}=\boldsymbol{k} \cdot \boldsymbol{w}=0\right.(kv=kw=0 ). (iii) By a calculation in a local Lorentz frame show that k d = ( k ) d k d = ( k ) d del_(k)d=(grad*k)d\partial_{\boldsymbol{k}} \mathscr{d}=(\boldsymbol{\nabla} \cdot \boldsymbol{k}) \mathfrak{d}kd=(k)d. (iv) Conclude from k a = 1 2 ( k ) a k a = 1 2 ( k ) a del_(k)a=-(1)/(2)(grad*k)a\partial_{\boldsymbol{k}} a=-\frac{1}{2}(\boldsymbol{\nabla} \cdot \boldsymbol{k}) aka=12(k)a that k ( a a 2 ) = 0 k a a 2 = 0 del_(k)(aa^(2))=0\partial_{\boldsymbol{k}}\left(a a^{2}\right)=0k(aa2)=0.]

Exercise 22.14. FOCUSING THEOREM

The cross-sectional area a a aaa of a bundle of rays all lying in the same surface of constant phase changes along the central ray of the bundle at the rate (22.36) (see Figure 22.1).
(a) Derive the following equation ("focusing equation") for the second derivative of a 1 / 2 a 1 / 2 a^(1//2)a^{1 / 2}a1/2 :
(22.37) d 2 Q 1 / 2 d λ 2 = ( | σ | 2 + 1 2 R α β k α k β ) a 1 / 2 (22.37) d 2 Q 1 / 2 d λ 2 = | σ | 2 + 1 2 R α β k α k β a 1 / 2 {:(22.37)(d^(2)Q^(1//2))/(dlambda^(2))=-(|sigma|^(2)+(1)/(2)R_(alpha beta)k^(alpha)k^(beta))a^(1//2):}\begin{equation*} \frac{d^{2} Q^{1 / 2}}{d \lambda^{2}}=-\left(|\sigma|^{2}+\frac{1}{2} R_{\alpha \beta} k^{\alpha} k^{\beta}\right) a^{1 / 2} \tag{22.37} \end{equation*}(22.37)d2Q1/2dλ2=(|σ|2+12Rαβkαkβ)a1/2
where λ λ lambda\lambdaλ is affine parameter along the central ray ( k = d / d λ ) ( k = d / d λ ) (k=d//d lambda)(\boldsymbol{k}=d / d \lambda)(k=d/dλ), and the "magnitude of the shear of the rays", | σ | | σ | |sigma||\sigma||σ|, is defined by the equation
(22.38) | σ | 2 1 2 k α ; β k α ; β 1 4 ( k ; μ μ ) 2 (22.38) | σ | 2 1 2 k α ; β k α ; β 1 4 k ; μ μ 2 {:(22.38)|sigma|^(2)-=(1)/(2)k_(alpha;beta)k^(alpha;beta)-(1)/(4)(k_(;mu)^(mu))^(2):}\begin{equation*} |\sigma|^{2} \equiv \frac{1}{2} k_{\alpha ; \beta} k^{\alpha ; \beta}-\frac{1}{4}\left(k_{; \mu}^{\mu}\right)^{2} \tag{22.38} \end{equation*}(22.38)|σ|212kα;βkα;β14(k;μμ)2
[Hint: This is a vigorous exercise in index manipulations. The key equations needed in the manipulations are a , α k α = ( k α ; α ) Q a , α k α = k α ; α Q a_(,alpha)k^(alpha)=(k^(alpha)_(;alpha))Q\mathscr{a}_{, \alpha} k^{\alpha}=\left(k^{\alpha}{ }_{; \alpha}\right) \mathscr{Q}a,αkα=(kα;α)Q [equation (22.36)]; k α ; β k β = 0 k α ; β k β = 0 k^(alpha)_(;beta)k^(beta)=0k^{\alpha}{ }_{; \beta} k^{\beta}=0kα;βkβ=0 [geodesic equation (22.32) for rays]; k α ; β = k β ; α k α ; β = k β ; α k_(alpha;beta)=k_(beta;alpha)k_{\alpha ; \beta}=k_{\beta ; \alpha}kα;β=kβ;α [which follows from k α θ , α ] k α θ , α {:k_(alpha)-=theta_(,alpha)]\left.k_{\alpha} \equiv \theta_{, \alpha}\right]kαθ,α]; and the rule (16.6c) for interchanging covariant derivatives of a vector.]
(b) Show that, in a local Lorentz frame where k = ω ( e t + e z ) k = ω e t + e z k=omega(e_(t)+e_(z))\boldsymbol{k}=\omega\left(\boldsymbol{e}_{t}+\boldsymbol{e}_{z}\right)k=ω(et+ez) at the origin,
(22.39) | σ | 2 = 1 4 ( k x , x k y , y ) 2 + ( k x , y ) 2 . (22.39) | σ | 2 = 1 4 k x , x k y , y 2 + k x , y 2 . {:(22.39)|sigma|^(2)=(1)/(4)(k_(x,x)-k_(y,y))^(2)+(k_(x,y))^(2).:}\begin{equation*} |\sigma|^{2}=\frac{1}{4}\left(k_{x, x}-k_{y, y}\right)^{2}+\left(k_{x, y}\right)^{2} . \tag{22.39} \end{equation*}(22.39)|σ|2=14(kx,xky,y)2+(kx,y)2.
Thus, | σ | 2 | σ | 2 |sigma|^(2)|\sigma|^{2}|σ|2 is nonnegative, which justifies the use of the absolute value sign.
(c) Discussion: The quantity | σ | | σ | |sigma||\sigma||σ| is called the shear of the bundle of rays because it measures the extent to which neighboring rays are sliding past each other [see, e.g., Sachs (1964)]. Hence, the focusing equation (22.37) says that shear focuses a bundle of rays (makes d 2 a 1 / 2 / d λ 2 < 0 d 2 a 1 / 2 / d λ 2 < 0 d^(2)a^(1//2)//dlambda^(2) < 0d^{2} a^{1 / 2} / d \lambda^{2}<0d2a1/2/dλ2<0 ); and spacetime curvature also focuses it if R α β k α k β > 0 R α β k α k β > 0 R_(alpha beta)k^(alpha)k^(beta) > 0R_{\alpha \beta} k^{\alpha} k^{\beta}>0Rαβkαkβ>0, but defocuses it if R α β k α k β < 0 R α β k α k β < 0 R_(alpha beta)k^(alpha)k^(beta) < 0R_{\alpha \beta} k^{\alpha} k^{\beta}<0Rαβkαkβ<0. (When a bundle of toothpicks, originally circular in cross section, is squeezed into an elliptic cross section, it is sheared.)
(d) Assume that the energy density T 0 ^ 0 ^ T 0 ^ 0 ^ T_( hat(0) hat(0))T_{\hat{0} \hat{0}}T0^0^, as measured by any observer anywhere in spacetime, is nonnegative. By combining the focusing equation (22.37) with the Einstein field equation, conclude that
(22.40) d 2 a 1 / 2 d λ 2 0 ( for any bundle of rays, all in the same surface of constant phase, anywhere in spacetime ) (22.40) d 2 a 1 / 2 d λ 2 0  for any bundle of rays, all in the same   surface of constant phase, anywhere in   spacetime  {:(22.40)(d^(2)a^(1//2))/(dlambda^(2)) <= 0([" for any bundle of rays, all in the same "],[" surface of constant phase, anywhere in "],[" spacetime "]):}\frac{d^{2} a^{1 / 2}}{d \lambda^{2}} \leq 0\left(\begin{array}{l} \text { for any bundle of rays, all in the same } \tag{22.40}\\ \text { surface of constant phase, anywhere in } \\ \text { spacetime } \end{array}\right)(22.40)d2a1/2dλ20( for any bundle of rays, all in the same  surface of constant phase, anywhere in  spacetime )
(focusing theorem). This theorem plays a crucial role in black-hole physics ( § 34.5 § 34.5 §34.5\S 34.5§34.5 ) and in the theory of singularities ( § 34.6 § 34.6 §34.6\S 34.6§34.6 ).

§22.6. KINETIC THEORY IN CURVED SPACETIME*

The stars in a galaxy wander through spacetime, each on its own geodesic world line, each helping to produce the spacetime curvature felt by all the others. Photons, left over from the hot phases of the big bang, bathe the Earth, bringing with themselves data on the homogeneity and isotropy of the universe. Theoretical analyses of these and many other problems are unmanageable, if they attempt to keep track of the motion of every single star or photon. But a statistical description gives accurate results and is powerful. Moreover, for most problems in astrophysics and cosmology, the simplest of statistical descriptions-one ignoring collisions-is adequate. Usually collisions are unimportant for the large-scale behavior of a system (e.g.; a galaxy), or they are so important that a fluid description is possible (e.g., in a stellar interior).
Consider, then, a swarm of particles (stars, or photons, or black holes, or . . .) that move through spacetime on geodesic world lines, without colliding. Assume, for simplicity, that the particles all have the same rest mass. Then all information of a statistical nature about the particles can be incorporated into a single function, the "distribution function" or "number density in phase space", O O O\mathscr{O}O.
Define R R R\mathscr{R}R in terms of measurements made by a specific local Lorentz observer at a specific event P 0 P 0 P_(0)\mathscr{P}_{0}P0 in curved spacetime. Give the observer a box with 3 -volume V x V x V_(x)\mathscr{V}_{x}Vx (and with imaginary walls). Ask the observer to count how many particles, N N NNN, are inside the box and have local-Lorentz momentum components p j p j p^(j)p^{j}pj in the range
P j 1 2 Δ p j < p j < P j + 1 2 Δ p j P j 1 2 Δ p j < p j < P j + 1 2 Δ p j P^(j)-(1)/(2)Deltap^(j) < p^(j) < P^(j)+(1)/(2)Deltap^(j)P^{j}-\frac{1}{2} \Delta p^{j}<p^{j}<P^{j}+\frac{1}{2} \Delta p^{j}Pj12Δpj<pj<Pj+12Δpj
(He can ignore the particle energies p 0 p 0 p^(0)p^{0}p0; since all particles have the same rest mass m m mmm, energy
p 0 = ( m 2 + p 2 ) 1 / 2 p 0 = m 2 + p 2 1 / 2 p^(0)=(m^(2)+p^(2))^(1//2)p^{0}=\left(m^{2}+p^{2}\right)^{1 / 2}p0=(m2+p2)1/2
Volume in phase space for a group of identical particles
Lorentz invariance of volume in phase space
Liouville's theorem (conservation of volume in phase space)
Number density in phase space (distribution function)
is fixed uniquely by momentum.) The volume in momentum space occupied by the N N NNN particles is V p = Δ p x Δ p y Δ p z V p = Δ p x Δ p y Δ p z V_(p)=Deltap^(x)Deltap^(y)Deltap^(z)\mathscr{V}_{p}=\Delta p^{x} \Delta p^{y} \Delta p^{z}Vp=ΔpxΔpyΔpz; and the volume in phase space is
(22.41) V V x V p (22.41) V V x V p {:(22.41)V-=V_(x)V_(p):}\begin{equation*} \mathscr{V} \equiv \mathscr{V}_{x} \mathscr{V}_{p} \tag{22.41} \end{equation*}(22.41)VVxVp
Other observers at P 0 P 0 P_(0)\mathscr{\mathscr { P }}_{0}P0, moving relative to the first, will disagree on how much spatial volume V x V x V_(x)\mathscr{V}_{x}Vx and how much momentum volume V p V p V_(p)\mathscr{V}_{p}Vp these same N N NNN particles occupy:
(22.42) V x and V p depend on the choice of Lorentz fram.e. (22.42) V x  and  V p  depend on the choice of Lorentz fram.e.  {:(22.42)V_(x)" and "V_(p)" depend on the choice of Lorentz fram.e. ":}\begin{equation*} \mathscr{V}_{x} \text { and } \mathscr{V}_{p} \text { depend on the choice of Lorentz fram.e. } \tag{22.42} \end{equation*}(22.42)Vx and Vp depend on the choice of Lorentz fram.e. 
However, all observers will agree on the value of the product V V x V p V V x V p V-=V_(x)V_(p)\mathscr{V} \equiv \mathscr{V}_{x} \mathscr{V}_{p}VVxVp ("volume in phase space"):
The phase-space volume V V V\mathscr{V}V occupied by a given set of N N NNN identical particles at a given event in spacetime is independent of the local Lorentz frame in which it is measured.
(See Box 22.5 for proof.) Moreover, as the same N N NNN particles move through spacetime along their geodesic world lines (and through momentum space), the volume V V V\mathscr{V}V they span in phase space remains constant:
The V V V\mathscr{V}V occupied by a given swarm of N N NNN particles is independent of location along the world line of the swarm ("Liouville's theorem in curved spacetime").
(See Box 22.6 for proof.)
More convenient for applications than the volume V V V\mathscr{V}V in phase space occupied by a given set of N N NNN particles is the "number density in phase space" ("distribution function") in the neighborhood of one of these particles:
(22.45) R N / V (22.45) R N / V {:(22.45)R-=N//V:}\begin{equation*} \mathscr{R} \equiv N / \mathscr{V} \tag{22.45} \end{equation*}(22.45)RN/V
On what does this number density depend? It depends on the location in spacetime, P P P\mathscr{P}P, at which the measurements are made. It also depends on the 4-momentum p p p\boldsymbol{p}p of the particle in whose neighborhood the measurements are made. But because the particles all have the same rest mass, p p p\boldsymbol{p}p cannot take on any and every value in the tangent space at P P P\mathscr{P}P. Rather, p p p\boldsymbol{p}p is confined to the "forward mass hyperboloid" at P P P\mathscr{P}P :
p 2 = m 2 ; p lies inside future light cone. p 2 = m 2 ; p  lies inside future light cone.  p^(2)=m^(2);quad p" lies inside future light cone. "\boldsymbol{p}^{2}=m^{2} ; \quad \boldsymbol{p} \text { lies inside future light cone. }p2=m2;p lies inside future light cone. 
Thus,
(22.46) R = R [ ( location, P , in spacetime ) , ( 4 -momentum p , which must lie on the forward mass hyperboloid of the tangent space at P ) ] . (22.46) R = R (  location,  P ,  in spacetime  ) , 4 -momentum  p ,  which must lie   on the forward mass hyperboloid   of the tangent space at  P . {:(22.46)R=R[((" location, "P,)/(" in spacetime ")),([4"-momentum "p","" which must lie "],[" on the forward mass hyperboloid "],[" of the tangent space at "P])].:}\mathscr{R}=\mathscr{R}\left[\binom{\text { location, } \mathscr{P},}{\text { in spacetime }},\left(\begin{array}{l} 4 \text {-momentum } \boldsymbol{p}, \text { which must lie } \tag{22.46}\\ \text { on the forward mass hyperboloid } \\ \text { of the tangent space at } \mathscr{P} \end{array}\right)\right] .(22.46)R=R[( location, P, in spacetime ),(4-momentum p, which must lie  on the forward mass hyperboloid  of the tangent space at P)].
Pick some one particle in the swarm, with geodesic world line P ( λ ) [ λ = P ( λ ) [ λ = P(lambda)[lambda=\mathscr{P}(\lambda)[\lambda=P(λ)[λ= (affine parameter ) = ( ) = ( )=()=()=( proper time, if particle has finite rest mass)], and with 4-momentum

Box 22.5 VOLUME IN PHASE SPACE

A. For Swarm of Identical Particles with Nonzero Rest Mass

Pick an event P 0 P 0 P_(0)\mathscr{P}_{0}P0, through which passes a particle named "John" with a 4 -momentum named " P P P\boldsymbol{P}P ". In John's local Lorentz rest frame at P 0 P 0 P_(0)\mathscr{\mathscr { P }}_{0}P0 ("barred frame", s ¯ s ¯ bar(s)\bar{s}s¯ ), select a small 3 -volume, V x Δ x ¯ Δ y ¯ V x Δ x ¯ Δ y ¯ V_(x)-=Delta bar(x)Delta bar(y)\mathscr{V}_{x} \equiv \Delta \bar{x} \Delta \bar{y}VxΔx¯Δy¯ Δ z ¯ Δ z ¯ Delta bar(z)\Delta \bar{z}Δz¯, containing him. Also select a small " 3 -volume in momentum space, V p ¯ Δ p x ¯ Δ p y ¯ Δ p z ¯ V p ¯ Δ p x ¯ Δ p y ¯ Δ p z ¯ ^(')V_( bar(p))-=Deltap^( bar(x))Deltap^( bar(y))Deltap^( bar(z)){ }^{\prime} \mathscr{V}_{\bar{p}} \equiv \Delta p^{\bar{x}} \Delta p^{\bar{y}} \Delta p^{\bar{z}}Vp¯Δpx¯Δpy¯Δpz¯ centered on John's momentum, which is P x ¯ = P y ¯ = P x ¯ = P y ¯ = P^( bar(x))=P^( bar(y))=P^{\bar{x}}=P^{\bar{y}}=Px¯=Py¯= P z ¯ = 0 P z ¯ = 0 P^( bar(z))=0P^{\bar{z}}=0Pz¯=0.Focus attention on all particles whose world lines pass through V x ¯ V x ¯ V_( bar(x))\mathscr{V}_{\bar{x}}Vx¯ and which have momenta p j ¯ p j ¯ p^( bar(j))p^{\bar{j}}pj¯ in the range V p ¯ V p ¯ V_( bar(p))\mathscr{V}_{\bar{p}}Vp¯ surrounding P j ¯ = 0 P j ¯ = 0 P^( bar(j))=0P^{\bar{j}}=0Pj¯=0.
Examine this bundle in another local Lorentz frame ("unbarred frame", S S SSS ) at P 0 P 0 P_(0)\mathscr{P}_{0}P0, which moves with speed β β beta\betaβ relative to the rest frame. Orient axes so the relative motion of the frames is in the x x xxx and x ¯ x ¯ bar(x)\bar{x}x¯ directions. Then the space volume V x V x V_(x)\mathscr{V}_{x}Vx occupied in the new frame has Δ y = Δ y ¯ , Δ z = Δ z ¯ Δ y = Δ y ¯ , Δ z = Δ z ¯ Delta y=Delta bar(y),Delta z=Delta bar(z)\Delta y=\Delta \bar{y}, \Delta z=\Delta \bar{z}Δy=Δy¯,Δz=Δz¯ (no effect of motion on transverse directions), and Δ x = ( 1 β 2 ) 1 / 2 Δ x ¯ Δ x = 1 β 2 1 / 2 Δ x ¯ Delta x=(1-beta^(2))^(1//2)Delta bar(x)\Delta x=\left(1-\beta^{2}\right)^{1 / 2} \Delta \bar{x}Δx=(1β2)1/2Δx¯ (Lorentz contraction in longitudinal direction). Hence V x = ( 1 β 2 ) 1 / 2 V x ¯ V x = 1 β 2 1 / 2 V x ¯ V_(x)=(1-beta^(2))^(1//2)V_( bar(x))\mathscr{V}_{x}=\left(1-\beta^{2}\right)^{1 / 2} \mathscr{V}_{\bar{x}}Vx=(1β2)1/2Vx¯ ("transformation law for space volumes") or, equivalently [since P 0 = m / ( 1 β 2 ) 1 / 2 ] P 0 = m / 1 β 2 1 / 2 {:P^(0)=m//(1-beta^(2))^(1//2)]\left.P^{0}=m /\left(1-\beta^{2}\right)^{1 / 2}\right]P0=m/(1β2)1/2] :
P 0 V x = m V x ¯ = ( constant, independent of Lorentz frame ) P 0 V x = m V x ¯ = (  constant, independent   of Lorentz frame  ) P^(0)V_(x)=mV_( bar(x))=((" constant, independent ")/(" of Lorentz frame "))P^{0} \mathscr{V}_{x}=m \mathscr{V}_{\bar{x}}=\binom{\text { constant, independent }}{\text { of Lorentz frame }}P0Vx=mVx¯=( constant, independent  of Lorentz frame )
A momentum-space diagram, analogous to the spacetime diagram, depicts the momentum spread for particles in the bundle, and shows that Δ p x = Δ p x = Deltap^(x)=\Delta p^{x}=Δpx= Δ p x ¯ / ( 1 β 2 ) 1 / 2 Δ p x ¯ / 1 β 2 1 / 2 Deltap^( bar(x))//(1-beta^(2))^(1//2)\Delta p^{\bar{x}} /\left(1-\beta^{2}\right)^{1 / 2}Δpx¯/(1β2)1/2. The Lorentz transformation from S ¯ S ¯ bar(S)\bar{S}S¯ to S S SSS leaves transverse components of momenta unaffected; so Δ p y = Δ p y ¯ , Δ p z = Δ p z ¯ Δ p y = Δ p y ¯ , Δ p z = Δ p z ¯ Deltap^(y)=Deltap^( bar(y)),Deltap^(z)=Deltap^( bar(z))\Delta p^{y}=\Delta p^{\bar{y}}, \Delta p^{z}=\Delta p^{\bar{z}}Δpy=Δpy¯,Δpz=Δpz¯. Hence V p = V p ¯ / ( 1 β 2 ) 1 / 2 V p = V p ¯ / 1 β 2 1 / 2 V_(p)=V_( bar(p))//(1-beta^(2))^(1//2)\mathscr{V}_{p}=\mathscr{V}_{\bar{p}} /\left(1-\beta^{2}\right)^{1 / 2}Vp=Vp¯/(1β2)1/2 ("transformation law for momentum volumes"); or, equivalently
V p P 0 = V p ¯ m = ( constant, independent of Lorentz frame ) V p P 0 = V p ¯ m = (  constant, independent   of Lorentz frame  ) (V_(p))/(P^(0))=(V_( bar(p)))/(m)=((" constant, independent ")/(" of Lorentz frame "))\frac{\mathscr{V}_{p}}{P^{0}}=\frac{\mathscr{V}_{\bar{p}}}{m}=\binom{\text { constant, independent }}{\text { of Lorentz frame }}VpP0=Vp¯m=( constant, independent  of Lorentz frame )
Although the spatial 3 -volumes V x V x V_(x)\mathscr{V}_{x}Vx and V x ¯ V x ¯ V_( bar(x))\mathscr{V}_{\bar{x}}Vx¯ differ from one frame to another, and the momentum 3-volumes V p V p V_(p)\mathscr{V}_{p}Vp and V p ¯ V p ¯ V_( bar(p))\mathscr{V}_{\bar{p}}Vp¯ differ, the volume in six-dimensional phase space is Lorentz-invariant:
V V x ¯ T p ¯ = V x V p V V x ¯ T p ¯ = V x V p V-=V_( bar(x))T_( bar(p))=V_(x)V_(p)\mathscr{V} \equiv \mathscr{V}_{\bar{x}} \mathscr{T}_{\bar{p}}=\mathscr{V}_{x} \mathscr{V}_{p}VVx¯Tp¯=VxVp
It is a frame-independent, geometric object!

B. For Swarm of Identical Particles with Zero Rest Mass

Examine a sequence of systems, each with particles of smaller rest mass and of higher velocity relative to a laboratory. For every bundle of particles in each system, P 0 V x , V p / P 0 P 0 V x , V p / P 0 P^(0)V_(x),V_(p)//P^(0)P^{0} \mathscr{V}_{x}, \mathscr{V}_{p} / P^{0}P0Vx,Vp/P0, and V x V p V x V p V_(x)V_(p)\mathscr{V}_{x} \mathscr{V}_{p}VxVp are Lorentzinvariant. Hence, in the limit as m 0 m 0 m longrightarrow0m \longrightarrow 0m0, as β 1 β 1 beta longrightarrow1\beta \longrightarrow 1β1, and as P 0 = m / ( 1 β 2 ) 1 / 2 P 0 = m / 1 β 2 1 / 2 P^(0)=m//(1-beta^(2))^(1//2)longrightarrowP^{0}=m /\left(1-\beta^{2}\right)^{1 / 2} \longrightarrowP0=m/(1β2)1/2 finite value (particles of zero rest mass moving with speed of light), P 0 V x P 0 V x P^(0)V_(x)P^{0} \mathscr{V}_{x}P0Vx and V p / P 0 V p / P 0 V_(p)//P^(0)\mathscr{V}_{p} / P^{0}Vp/P0 and V x V p V x V p V_(x)V_(p)\mathscr{V}_{x} \mathscr{V}_{p}VxVp are still Lorentz-invariant, geometric quantities.

Box 22.6 CONSERVATION OF VOLUME IN PHASE SPACE

Examine a very small bundle of identical particles that move through curved spacetime on neighboring geodesics. Measure the bundle's volume in phase space, V ( V = V x V p V V = V x V p V(V=V_(x)V_(p):}\mathscr{V}\left(\mathscr{V}=\mathscr{V}_{x} \mathscr{V}_{p}\right.V(V=VxVp in any local Lorentz frame), as a function of affine parameter λ λ lambda\lambdaλ along the central geodesic of the bundle. The following calculation shows that
d V / d λ = 0 ( "Liouville theorem in curved spacetime" ) . d V / d λ = 0 (  "Liouville theorem in   curved spacetime"  ) . dV//d lambda=0quad((" "Liouville theorem in ")/(" curved spacetime" ")).d \mathscr{V} / d \lambda=0 \quad\binom{\text { "Liouville theorem in }}{\text { curved spacetime" }} .dV/dλ=0( "Liouville theorem in  curved spacetime" ).
Proof for particles of finite rest mass: Examine particle motion during time interval δ τ δ τ delta tau\delta \tauδτ, using local Lorentz rest frame of central particle. All velocities are small in this frame, so
p j ¯ = m d x j ¯ / d t ¯ p j ¯ = m d x j ¯ / d t ¯ p^( bar(j))=mdx^( bar(j))//d bar(t)p^{\bar{j}}=m d x^{\bar{j}} / d \bar{t}pj¯=mdxj¯/dt¯
Hence (see pictures) the spreads in momentum and position conserve Δ x ¯ Δ p x ¯ , Δ y ¯ Δ p y ¯ Δ x ¯ Δ p x ¯ , Δ y ¯ Δ p y ¯ Delta bar(x)Deltap^( bar(x)),Delta bar(y)Deltap^( bar(y))\Delta \bar{x} \Delta p^{\bar{x}}, \Delta \bar{y} \Delta p^{\bar{y}}Δx¯Δpx¯,Δy¯Δpy¯, and Δ z ¯ Δ p z ¯ Δ z ¯ Δ p z ¯ Delta bar(z)Deltap^( bar(z))\Delta \bar{z} \Delta p^{\bar{z}}Δz¯Δpz¯; i.e.,
d V d τ = δ ( Δ x ¯ Δ y ¯ Δ z ¯ Δ p x ¯ Δ p y ¯ Δ p z ¯ ) δ t ¯ = 0 d V d τ = δ Δ x ¯ Δ y ¯ Δ z ¯ Δ p x ¯ Δ p y ¯ Δ p z ¯ δ t ¯ = 0 (dV)/(d tau)=(delta(Delta( bar(x))Delta( bar(y))Delta( bar(z))Deltap^( bar(x))Deltap^( bar(y))Deltap^( bar(z))))/(delta( bar(t)))=0\frac{d \mathscr{V}}{d \tau}=\frac{\delta\left(\Delta \bar{x} \Delta \bar{y} \Delta \bar{z} \Delta p^{\bar{x}} \Delta p^{\bar{y}} \Delta p^{\bar{z}}\right)}{\delta \bar{t}}=0dVdτ=δ(Δx¯Δy¯Δz¯Δpx¯Δpy¯Δpz¯)δt¯=0
But τ = a λ + b τ = a λ + b tau=a lambda+b\tau=a \lambda+bτ=aλ+b for some arbitrary constants a a aaa and b b bbb; so d V / d λ = 0 d V / d λ = 0 dV//d lambda=0d \mathscr{V} / d \lambda=0dV/dλ=0.
Proof for particles of zero rest mass. Examine particle motion in local Lorentz frame where central particle has P = P 0 ( e 0 + e x ) P = P 0 e 0 + e x P=P^(0)(e_(0)+e_(x))\boldsymbol{P}=P^{0}\left(\boldsymbol{e}_{0}+\boldsymbol{e}_{x}\right)P=P0(e0+ex). In this frame, all particles have p y p 0 , p z p 0 , p x = p 0 + p y p 0 , p z p 0 , p x = p 0 + p^(y)≪p^(0),p^(z)≪p^(0),quadp^(x)=p^(0)+p^{y} \ll p^{0}, p^{z} \ll p^{0}, \quad p^{x}=p^{0}+pyp0,pzp0,px=p0+ O ( [ p y ] 2 / P 0 ) P 0 O p y 2 / P 0 P 0 O([p^(y)]^(2)//P^(0))~~P^(0)O\left(\left[p^{y}\right]^{2} / P^{0}\right) \approx P^{0}O([py]2/P0)P0. Since p α = d x α / d λ p α = d x α / d λ p^(alpha)=dx^(alpha)//d lambdap^{\alpha}=d x^{\alpha} / d \lambdapα=dxα/dλ for appropriate normalization of affine parameters (see Box 22.4), one can write d x j / d t = p j / p 0 d x j / d t = p j / p 0 dx^(j)//dt=p^(j)//p^(0)d x^{j} / d t=p^{j} / p^{0}dxj/dt=pj/p0; i.e.,
d x d t = 1 + O ( [ p y / P 0 ] 2 + [ p z / P 0 ] 2 ) 1 d y d t = p y P 0 , d z d t = p z P 0 d x d t = 1 + O p y / P 0 2 + p z / P 0 2 1 d y d t = p y P 0 , d z d t = p z P 0 {:[(dx)/(dt)=1+O([p^(y)//P^(0)]^(2)+[p^(z)//P^(0)]^(2))],[~~1],[(dy)/(dt)=(p^(y))/(P^(0))","quad(dz)/(dt)=(p^(z))/(P^(0))]:}\begin{aligned} \frac{d x}{d t} & =1+O\left(\left[p^{y} / P^{0}\right]^{2}+\left[p^{z} / P^{0}\right]^{2}\right) \\ & \approx 1 \\ \frac{d y}{d t} & =\frac{p^{y}}{P^{0}}, \quad \frac{d z}{d t}=\frac{p^{z}}{P^{0}} \end{aligned}dxdt=1+O([py/P0]2+[pz/P0]2)1dydt=pyP0,dzdt=pzP0

t ¯ = 0 t ¯ = 0 bar(t)=0\bar{t}=0t¯=0

t ¯ = δ t ¯ t ¯ = δ t ¯ bar(t)=delta bar(t)\bar{t}=\delta \bar{t}t¯=δt¯
Each particle moves with speed d x ¯ / d t ¯ d x ¯ / d t ¯ d bar(x)//d bar(t)d \bar{x} / d \bar{t}dx¯/dt¯ proportional to height in diagram
d x ¯ / d t ¯ = p x ¯ / m d x ¯ / d t ¯ = p x ¯ / m d bar(x)//d bar(t)=p^( bar(x))//md \bar{x} / d \bar{t}=p^{\bar{x}} / mdx¯/dt¯=px¯/m
and conserves its momentum, d p r ¯ / d t ¯ = 0 d p r ¯ / d t ¯ = 0 dp^( bar(r))//d bar(t)=0d p^{\bar{r}} / d \bar{t}=0dpr¯/dt¯=0. Hence the region occupied by particles deforms, but maintains its area. Same is true for ( y p y ) y p y (y-p^(y))\left(y-p^{y}\right)(ypy) and ( z p z ) z p z (z-p^(z))\left(z-p^{z}\right)(zpz).
Each particle ("photon") moves with d x / d t = 1 d x / d t = 1 dx//dt=1d x / d t=1dx/dt=1 and d p x / d t = 0 d p x / d t = 0 dp^(x)//dt=0d p^{x} / d t=0dpx/dt=0 in the local Lorentz frame. Area and shape of occupied region are preserved.
Hence (see pictures) Δ x Δ p x , Δ y Δ p y Δ x Δ p x , Δ y Δ p y Delta x Deltap^(x),Delta y Deltap^(y)\Delta x \Delta p^{x}, \Delta y \Delta p^{y}ΔxΔpx,ΔyΔpy, and Δ z Δ p z Δ z Δ p z Delta z Deltap^(z)\Delta z \Delta p^{z}ΔzΔpz are all conserved; and
d V d t = δ ( Δ x Δ y Δ z Δ p x Δ p y Δ p z ) δ t = 0 . d V d t = δ Δ x Δ y Δ z Δ p x Δ p y Δ p z δ t = 0 . (dV)/(dt)=(delta(Delta x Delta y Delta z Deltap^(x)Deltap^(y)Deltap^(z)))/(delta t)=0.\frac{d \mathscr{V}}{d t}=\frac{\delta\left(\Delta x \Delta y \Delta z \Delta p^{x} \Delta p^{y} \Delta p^{z}\right)}{\delta t}=0 .dVdt=δ(ΔxΔyΔzΔpxΔpyΔpz)δt=0.
But t t ttt and the affine parameter λ λ lambda\lambdaλ of central particle are related by t = P 0 λ t = P 0 λ t=P^(0)lambdat=P^{0} \lambdat=P0λ [cf. equation (16.4)]; thus
d V / d λ = 0 d V / d λ = 0 dV//d lambda=0d \mathscr{V} / d \lambda=0dV/dλ=0
Particle ("photon") speeds are proportional to height in diagram
d y / d t = p y / P 0 d y / d t = p y / P 0 dy//dt=p^(y)//P^(0)d y / d t=p^{y} / P^{0}dy/dt=py/P0
and d p y / d t = 0 d p y / d t = 0 dp^(y)//dt=0d p^{y} / d t=0dpy/dt=0. Hence, occupied region deforms but maintains its area. Same is true of z p z z p z z-p^(z)z-p^{z}zpz.
p ( λ ) p ( λ ) p(lambda)\boldsymbol{p}(\lambda)p(λ). Examine the density in phase space in this particle's neighborhood at each point along its world line:
X = R [ P ( λ ) , p ( λ ) ] . X = R [ P ( λ ) , p ( λ ) ] . X=R[P(lambda),p(lambda)].\mathscr{X}=\mathscr{R}[\mathscr{P}(\lambda), \boldsymbol{p}(\lambda)] .X=R[P(λ),p(λ)].
Calculate T ( λ ) T ( λ ) T(lambda)\mathscr{T}(\lambda)T(λ) as follows: (1) Pick an initial event P ( 0 ) P ( 0 ) P(0)\mathscr{P}(0)P(0) on the world line, and a phase-space volume V V V\mathscr{V}V containing the particle. (2) Cover with red paint all the particles contained in V V V\mathscr{V}V at P ( 0 ) P ( 0 ) P(0)\mathscr{P}(0)P(0). (3) Watch the red particles move through spacetime alongside the initial particle. (4) As they move, the phase-space region they occupy changes shape extensively; but its volume V V V\mathscr{V}V remains fixed (Liouville's theorem). Moreover, no particles can enter or leave that phase-space region (once in, always in; once out, always out; boundaries of phase-space region are attached to and move with the particles). (5) Hence, at any λ λ lambda\lambdaλ along the initial particle's world line, the particle is in a phase-space region of unchanged volume V V V\mathscr{V}V, unchanged number of particles N N NNN, and unchanged ratio r = N / V r = N / V r=N//V\mathscr{\mathscr { r }}=N / \mathscr{V}r=N/V :
(22.47) d R [ P ( λ ) , p ( λ ) ] d λ = 0 . (22.47) d R [ P ( λ ) , p ( λ ) ] d λ = 0 . {:(22.47)(dR[P(lambda),p(lambda)])/(d lambda)=0.:}\begin{equation*} \frac{d \mathscr{R}[\mathscr{P}(\lambda), \boldsymbol{p}(\lambda)]}{d \lambda}=0 . \tag{22.47} \end{equation*}(22.47)dR[P(λ),p(λ)]dλ=0.
Collisionless Boltzmann equation (kinetic equation)
This equation for the conservation of Z Z Z\mathscr{\mathscr { Z }}Z along a particle's trajectory in phase space is called the "collisionless Boltzmann equation," or the "kinetic equation."
Photons provide an important application of the Boltzmann equation. But when discussing photons one usually does not think in terms of the number density in phase space. Rather, one speaks of the "specific intensity" I ν I ν I_(nu)I_{\nu}Iν of radiation at a given frequency ν ν nu\nuν, flowing in a given direction, n n n\boldsymbol{n}n, as measured in a specified local Lorentz frame:
(22.48) I ν d ( energy ) d ( time ) d ( area ) d ( frequency ) d ( solid angle ) (22.48) I ν d (  energy  ) d (  time  ) d (  area  ) d (  frequency  ) d (  solid angle  ) {:(22.48)I_(nu)-=(d(" energy "))/(d(" time ")d(" area ")d(" frequency ")d(" solid angle ")):}\begin{equation*} I_{\nu} \equiv \frac{d(\text { energy })}{d(\text { time }) d(\text { area }) d(\text { frequency }) d(\text { solid angle })} \tag{22.48} \end{equation*}(22.48)Iνd( energy )d( time )d( area )d( frequency )d( solid angle )
Distribution function for photons expressed in terms of specific intensity, I p I p I_(p)I_{p}Ip
Invariance and conservation of I p / ν 3 I p / ν 3 I_(p)//nu^(3)I_{p} / \nu^{3}Ip/ν3
(See Figure 22.2). A simple calculation in the local Lorentz frame reveals that
(22.49) ϰ = h 4 ( I ν / ν 3 ) , (22.49) ϰ = h 4 I ν / ν 3 , {:(22.49)ϰ=h^(-4)(I_(nu)//nu^(3))",":}\begin{equation*} \mathscr{\varkappa}=h^{-4}\left(I_{\nu} / \nu^{3}\right), \tag{22.49} \end{equation*}(22.49)ϰ=h4(Iν/ν3),
where h h hhh is Planck's constant (see Figure 22.2). Thus, if two different observers at the same or different events in spacetime look at the same photon (and neighboring photons) as it passes them, they will see different frequencies ν ν nu\nuν ("doppler shift," "cosmological red shift," "gravitational redshift"), and different specific intensities I ν I ν I_(nu)I_{\nu}Iν; but they will obtain identical values for the ratio I ν / ν 3 I ν / ν 3 I_(nu)//nu^(3)I_{\nu} / \nu^{3}Iν/ν3. Thus I p / ν 3 I p / ν 3 I_(p)//nu^(3)I_{p} / \nu^{3}Ip/ν3, like R R R\mathscr{R}R, is invariant from observer to observer and from event to event along a given photon's world line.

EXERCISES

Exercise 22.15. INVERSE SQUARE LAW FOR FLUX

The specific flux of radiation entering a telescope from a given source is defined by
(22.50) F v = I v d Ω (22.50) F v = I v d Ω {:(22.50)F_(v)=intI_(v)d Omega:}\begin{equation*} F_{v}=\int I_{v} d \Omega \tag{22.50} \end{equation*}(22.50)Fv=IvdΩ
where integration is over the total solid angle (assumed 4 π 4 π ≪4pi\ll 4 \pi4π ) subtended by the source on the observer's sky. Use the Boltzmann equation (conservation of I ν / ν 3 I ν / ν 3 I_(nu)//nu^(3)I_{\nu} / \nu^{3}Iν/ν3 ) to show that F v ( distance from source ) 2 F v (  distance from source  ) 2 F_(v)prop(" distance from source ")^(-2)F_{v} \propto(\text { distance from source })^{-2}Fv( distance from source )2 for observers who are all at rest relative to each other in flat spacetime.

Exercise 22.16. BRIGHTNESS OF THE SUN

Does the surface of the sun look any brighter to an astronaut standing on Mercury than to a student standing on Earth?

Exercise 22.17. BLACK BODY RADIATION

An "optically thick" source of black-body radiation (e.g., the surface of a star, or the hot matter filling the universe shortly after the big bang) emits photons isotropically with a specific intensity, as seen by an observer at rest near the source, given (Planck radiation law) by
(22.51) I ν = 2 h v 3 e h ν / k T 1 (22.51) I ν = 2 h v 3 e h ν / k T 1 {:(22.51)I_(nu)=(2hv^(3))/(e^(h nu//kT)-1):}\begin{equation*} I_{\nu}=\frac{2 h v^{3}}{e^{h \nu / k T}-1} \tag{22.51} \end{equation*}(22.51)Iν=2hv3ehν/kT1
Here T T TTT is the temperature of the source. Show that any observer, in any local Lorentz frame, anywhere in the universe, who examines this radiation as it flows past him, will also see a black-body spectrum. Show, further, that if he calculates a temperature by measuring the specific intensity I ν I ν I_(nu)I_{\nu}Iν at any one frequency, and if he calculates a temperature from the shape of the spectrum, those temperatures will agree. (Radiation remains black body rather than being "diluted" into "grey-body.") Finally, show that the temperature he measures is redshifted by precisely the same factor as the frequency of any given photon is redshifted,
(22.52) T observed T emitted = ( v observed v emitted ) for a given photon. (22.52) T observed  T emitted  = v observed  v emitted   for a given photon.  {:(22.52)(T_("observed "))/(T_("emitted "))=((v_("observed "))/(v_("emitted ")))" for a given photon. ":}\begin{equation*} \frac{T_{\text {observed }}}{T_{\text {emitted }}}=\left(\frac{v_{\text {observed }}}{v_{\text {emitted }}}\right) \text { for a given photon. } \tag{22.52} \end{equation*}(22.52)Tobserved Temitted =(vobserved vemitted ) for a given photon. 
[Note that the redshifts can be "Doppler" in origin, "cosmological" in origin, "gravitational" in origin, or some inseparable mixture. All that matters is the fact that the parallel-transport law for a photon's 4-momentum, p p = 0 p p = 0 grad_(p)p=0\boldsymbol{\nabla}_{\boldsymbol{p}} \boldsymbol{p}=0pp=0, guarantees that the redshift ν observed / v emitted ν observed  / v emitted  nu_("observed ")//v_("emitted ")\nu_{\text {observed }} / v_{\text {emitted }}νobserved /vemitted  is independent of frequency emitted.]
3 -momentum volume, with direction of momentum vectors reversed for ease of visualization (telescope as an emitter, not a receiver!)
Figure 22.2.
Number density in phase space for photons, interpreted in terms of the specific intensity I y I y I_(y)I_{y}Iy. An astronomer has a telescope with filter that admits only photons arriving from within a small solid angle Δ Ω Δ Ω Delta Omega\Delta \OmegaΔΩ about the z z zzz-direction, and having energies between p 0 p 0 p^(0)p^{0}p0 and p 0 + Δ p 0 p 0 + Δ p 0 p^(0)+Deltap^(0)p^{0}+\Delta p^{0}p0+Δp0. The collecting area, a a aaa, of his telescope lies in the x , y x , y x,yx, yx,y-plane (perpendicular to the incoming photon beam). Let δ N δ N delta N\delta NδN be the number of photons that cross the area a a a\mathscr{a}a in a time interval δ t δ t delta t\delta tδt. [All energies, areas, times, and lengths are measured in the orthonormal frame ("proper reference frame; §13.6) which the astronomer Fermi-Walker transports with himself along his (possibly accelerated) world line-or, equivalently, in a local Lorentz frame momentarily at rest with respect to the astronomer.] The δ N δ N delta N\delta NδN photons, just before the time interval δ t δ t delta t\delta tδt begins, lie in the cylinder of area a a aaa and height δ z = δ t δ z = δ t delta z=delta t\delta z=\delta tδz=δt shown above. Their spatial 3 -volume is thus V x = a δ t V x = a δ t V_(x)=a delta t\mathscr{V}_{x}=a \delta tVx=aδt. Their momentum 3-volume is V p = ( p 0 ) 2 Δ p 0 Δ Ω V p = p 0 2 Δ p 0 Δ Ω V_(p)=(p^(0))^(2)Deltap^(0)Delta Omega\mathscr{V}_{p}=\left(p^{0}\right)^{2} \Delta p^{0} \Delta \OmegaVp=(p0)2Δp0ΔΩ (see drawing). Hence, their number density in phase space is
R = δ N V x V ~ p = δ N a δ t ( p 0 ) 2 ( Δ p 0 ) Δ Ω = δ N h 3 G δ t v 2 Δ v Δ Ω R = δ N V x V ~ p = δ N a δ t p 0 2 Δ p 0 Δ Ω = δ N h 3 G δ t v 2 Δ v Δ Ω R=(delta N)/(V_(x) widetilde(V)_(p))=(delta N)/(a quad delta t(p^(0))^(2)(Deltap^(0))Delta Omega)=(delta N)/(h^(3)G delta tv^(2)Delta v Delta Omega)\mathscr{R}=\frac{\delta N}{\mathscr{V}_{x} \widetilde{V}_{p}}=\frac{\delta N}{a \quad \delta t\left(p^{0}\right)^{2}\left(\Delta p^{0}\right) \Delta \Omega}=\frac{\delta N}{h^{3} G \delta t v^{2} \Delta v \Delta \Omega}R=δNVxV~p=δNaδt(p0)2(Δp0)ΔΩ=δNh3Gδtv2ΔvΔΩ
where ν ν nu\nuν is the photon frequency measured by the telescope ( p 0 = h ν p 0 = h ν p^(0)=h nup^{0}=h \nup0=hν ).
The specific intensity of the photons, I p I p I_(p)I_{p}Ip (a standard concept in astronomy), is the energy per unit area per unit time per unit frequency per unit solid angle crossing a surface perpendicular to the beam: i.e.,
I ν = h v δ N a δ t Δ v Δ Ω I ν = h v δ N a δ t Δ v Δ Ω I_(nu)=(hv delta N)/(a delta t Delta v Delta Omega)I_{\nu}=\frac{h v \delta N}{a \delta t \Delta v \Delta \Omega}Iν=hvδNaδtΔvΔΩ
Direct comparison reveals π = h 4 ( I ν / ν 3 ) π = h 4 I ν / ν 3 pi=h^(-4)(I_(nu)//nu^(3))\mathscr{\pi}=h^{-4}\left(I_{\nu} / \nu^{3}\right)π=h4(Iν/ν3).
Thus, conservation of r r r\mathscr{\mathscr { r }}r along a photon's world line implies conservation of I y / ν 3 I y / ν 3 I_(y)//nu^(3)I_{y} / \nu^{3}Iy/ν3. This conservation law finds important applications in cosmology (e.g., Box 29.2 and Ex. 29.5) and in the gravitational lens effect (Refsdal 1964); see also exercises 22.15-22.17.

Exercise 22.18. STRESS-ENERGY TENSOR

(a) Show that the stress-energy tensor for a swarm of identical particles at an event P 0 P 0 P_(0)\mathscr{P}_{0}P0 can be written as an integral over the mass hyperboloid of the momentum space at P 0 P 0 P_(0)\mathscr{P}_{0}P0 :
(22.53) T = ( T p p ) ( d V p / p 0 ) (22.54) d V p p 0 d p x d p y d p z p 0 in a local Lorentz frame. (22.53) T = ( T p p ) d V p / p 0 (22.54) d V p p 0 d p x d p y d p z p 0  in a local Lorentz frame.  {:[(22.53)T=int(Tp ox p)(dV_(p)//p^(0))],[(22.54)(dV_(p))/(p^(0))-=(dp^(x)dp^(y)dp^(z))/(p^(0))" in a local Lorentz frame. "]:}\begin{gather*} \boldsymbol{T}=\int(\mathscr{T} \boldsymbol{p} \otimes \boldsymbol{p})\left(d \mathscr{V}_{p} / p^{0}\right) \tag{22.53}\\ \frac{d \mathscr{V}_{p}}{p^{0}} \equiv \frac{d p^{x} d p^{y} d p^{z}}{p^{0}} \text { in a local Lorentz frame. } \tag{22.54} \end{gather*}(22.53)T=(Tpp)(dVp/p0)(22.54)dVpp0dpxdpydpzp0 in a local Lorentz frame. 
(Notice from Box 22.5 that d V p / p 0 d V p / p 0 dV_(p)//p^(0)d \mathscr{V}_{p} / p^{0}dVp/p0 is a Lorentz-invariant volume element for any segment of the mass hyperboloid.)
(b) Verify that the Boltzmann equation, d π / d λ = 0 d π / d λ = 0 dpi//d lambda=0d \mathscr{\pi} / d \lambda=0dπ/dλ=0, implies T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 for any swarm of identical particles. [Hint: Calculate T T grad*T\boldsymbol{\nabla} \cdot \boldsymbol{T}T in a local Lorentz frame, using the above expression for T T T\boldsymbol{T}T, and using the geodesic equation in the form D p μ / d λ = 0 D p μ / d λ = 0 Dp^(mu)//d lambda=0D p^{\mu} / d \lambda=0Dpμ/dλ=0.]

Exercise 22.19. KINETIC THEORY FOR NONIDENTICAL PARTICLES

For a swarm of particles with a wide distribution of rest masses, define
(22.55) R = Δ N V x V p Δ m , (22.55) R = Δ N V x V p Δ m , {:(22.55)R=(Delta N)/(V_(x)V_(p)Delta m)",":}\begin{equation*} \mathscr{R}=\frac{\Delta N}{\mathscr{V}_{x} \mathscr{V}_{p} \Delta m}, \tag{22.55} \end{equation*}(22.55)R=ΔNVxVpΔm,
where V x V x V_(x)\mathscr{V}_{x}Vx and V p V p V_(p)\mathscr{V}_{p}Vp are spatial and momentum 3-volumes, and Δ N Δ N Delta N\Delta NΔN is the number of particles in the region V x V p V x V p V_(x)V_(p)\mathscr{V}_{x} \mathscr{V}_{p}VxVp with rest masses between m Δ m / 2 m Δ m / 2 m-Delta m//2m-\Delta m / 2mΔm/2 and m + Δ m / 2 m + Δ m / 2 m+Delta m//2m+\Delta m / 2m+Δm/2. Show the following.
(a) T x V p Δ m T x V p Δ m T_(x)V_(p)Delta m\mathscr{T}_{x} \mathscr{V}_{p} \Delta mTxVpΔm is independent of Lorentz frame and independent of location on the world tube of a bundle of particles.
(b) R R R\mathscr{R}R can be regarded as a function of location P P P\mathscr{P}P in spacetime and 4-momentum p p p\boldsymbol{p}p inside the future light cone of the tangent space at P P P\mathscr{P}P :
(22.56) R = H ( P , p ) . (22.56) R = H ( P , p ) . {:(22.56)R=H(P","p).:}\begin{equation*} \mathscr{R}=\mathscr{H}(\mathscr{P}, \boldsymbol{p}) . \tag{22.56} \end{equation*}(22.56)R=H(P,p).
(c) π π pi\mathscr{\pi}π satisfies the collisionless Boltzmann equation (kinetic equation)
(22.57) d H [ P ( λ ) , p ( λ ) ] d λ = 0 along geodesic trajectory of any particle. (22.57) d H [ P ( λ ) , p ( λ ) ] d λ = 0  along geodesic trajectory of any particle.  {:(22.57)(dH[P(lambda),p(lambda)])/(d lambda)=0quad" along geodesic trajectory of any particle. ":}\begin{equation*} \frac{d \mathscr{\mathscr { H }}[\mathscr{P}(\lambda), \boldsymbol{p}(\lambda)]}{d \lambda}=0 \quad \text { along geodesic trajectory of any particle. } \tag{22.57} \end{equation*}(22.57)dH[P(λ),p(λ)]dλ=0 along geodesic trajectory of any particle. 
(d) R R R\mathscr{R}R can be rewritten in a local Lorentz frame as
(22.58) = Δ N [ ( p 0 / m ) Δ x Δ y Δ z ] [ Δ p 0 Δ p x Δ p y Δ p z ] (22.58) = Δ N p 0 / m Δ x Δ y Δ z Δ p 0 Δ p x Δ p y Δ p z {:(22.58)ℜ=(Delta N)/([(p^(0)//m)Delta x Delta y Delta z][Deltap^(0)Deltap^(x)Deltap^(y)Deltap^(z)]):}\begin{equation*} \Re=\frac{\Delta N}{\left[\left(p^{0} / m\right) \Delta x \Delta y \Delta z\right]\left[\Delta p^{0} \Delta p^{x} \Delta p^{y} \Delta p^{z}\right]} \tag{22.58} \end{equation*}(22.58)=ΔN[(p0/m)ΔxΔyΔz][Δp0ΔpxΔpyΔpz]
(e) The stress-energy tensor at an event P P P\mathscr{P}P can be written as an integral over the interior of the future light cone of momentum space
(22.59) T μ ν = ( p μ p ν ) m 1 d p 0 d p 1 d p 2 d p 3 (22.59) T μ ν = p μ p ν m 1 d p 0 d p 1 d p 2 d p 3 {:(22.59)T^(mu nu)=int(ℜp^(mu)p^(nu))m^(-1)dp^(0)dp^(1)dp^(2)dp^(3):}\begin{equation*} T^{\mu \nu}=\int\left(\Re p^{\mu} p^{\nu}\right) m^{-1} d p^{0} d p^{1} d p^{2} d p^{3} \tag{22.59} \end{equation*}(22.59)Tμν=(pμpν)m1dp0dp1dp2dp3
in a local Lorentz frame (Track-1 notation for integral; see Box 5.3);
T = ( τ p p ) m 1 1 in frame-independent notation ( ) = ( p p ) m 1 d p 0 d p 1 d p 2 d p 3 T = ( τ p p ) m 1 1  in frame-independent notation  ( ) = ( p p ) m 1 d p 0 d p 1 d p 2 d p 3 {:[T=int(tau^(⏜)p ox p)m^(-1**)1quad" in frame-independent notation "],[('")"=int(^(⏜)p ox p)m^(-1)dp^(0)^^dp^(1)^^dp^(2)^^dp^(3)]:}\begin{align*} \boldsymbol{T} & =\int(\overparen{\tau} \boldsymbol{p} \otimes \boldsymbol{p}) m^{-1 *} 1 \quad \text { in frame-independent notation } \\ & =\int(\overparen{ } \boldsymbol{p} \otimes \boldsymbol{p}) m^{-1} \boldsymbol{d} p^{0} \wedge \boldsymbol{d} p^{1} \wedge \boldsymbol{d} p^{2} \wedge \boldsymbol{d} p^{3} \tag{$\prime$} \end{align*}T=(τpp)m11 in frame-independent notation ()=(pp)m1dp0dp1dp2dp3
in a local Lorentz frame (Track-2 notation; see Box 5.4).

RELATIVISTIC STARS

Wherein the reader, armed with the magic potions and powers of Geometrodynamics, conquers the stars.

CHAPTER 3

SPHERICAL STARS

§23.1. PROLOG

Beautiful though gravitation theory may be, it is a sterile subject until it touches the real physical world. Only the hard reality of experiments and of astronomical observations can bring gravitation theory to life. And only by building theoretical models of stars (Part V), of the universe (Part VI), of stellar collapse and black holes (Part VII), of gravitational waves and their sources (Part VIII), and of gravitational experiments (Part IX), can one understand clearly the contacts between gravitation theory and reality.
The model-building in this book will follow the tradition of theoretical physics. Each Part (stars, universe, collapse, . . .) will begin with the most oversimplified model conceivable, and will subsequently add only those additional touches of realism necessary to make contact with the least complex of actual physical systems. The result will be a tested intellectual framework, ready to support and organize the additional complexities demanded by greater realism. Greater realism will not be attempted in this book. But the reader seeking it could start in no better place than the two-volume treatise on Relativistic Astrophysics by Zel'dovich and Novikov (1971, 1974).
Begin, now, with models for relativistic stars. As a major simplification, insist (initially) that all stars studied be static. Thereby exclude not only exploding and pulsating stars, but even quiescent ones with stationary rotational motions. From the static assumption, plus a demand that the star be made of "perfect fluid" (no shear stresses allowed!), plus Einstein's field equations, it probably follows that the star is spherically symmetric. However, nobody has yet given a proof. [For proofs under more restricted assumptions, see Avez (1964) and Kunzle (1971).] In the absence of a proof, assume the result: insist that all stars studied be spherical as well as static.
Preview of the rest of this book
Static stars must be spherical

§23.2. COORDINATES AND METRIC FOR A STATIC, SPHERICAL SYSTEM

Metric for any static, spherical system:
(1) generalized from flat spacetime
(2) specialized to
"Schwarzschild form"
To deduce the gravitational field for a static spherical star-or for any other static, spherical system-begin with the metric of special relativity (no gravity) in the spherically symmetric form
(23.1) d s 2 = d t 2 + d r 2 + r 2 d Ω 2 (23.1) d s 2 = d t 2 + d r 2 + r 2 d Ω 2 {:(23.1)ds^(2)=-dt^(2)+dr^(2)+r^(2)dOmega^(2):}\begin{equation*} d s^{2}=-d t^{2}+d r^{2}+r^{2} d \Omega^{2} \tag{23.1} \end{equation*}(23.1)ds2=dt2+dr2+r2dΩ2
where
(23.2) d Ω 2 = d θ 2 + sin 2 θ d ϕ 2 (23.2) d Ω 2 = d θ 2 + sin 2 θ d ϕ 2 {:(23.2)dOmega^(2)=dtheta^(2)+sin^(2)theta dphi^(2):}\begin{equation*} d \Omega^{2}=d \theta^{2}+\sin ^{2} \theta d \phi^{2} \tag{23.2} \end{equation*}(23.2)dΩ2=dθ2+sin2θdϕ2
Try to modify this metric to allow for curvature due to the gravitational influence of the star, while preserving spherical symmetry. The simplest and most obvious guess is to allow those metric components that are already non-zero in equation (23.1) to assume different values:
(23.3) d s 2 = e 2 ϕ d t 2 + e 2 Λ d r 2 + R 2 d Ω 2 (23.3) d s 2 = e 2 ϕ d t 2 + e 2 Λ d r 2 + R 2 d Ω 2 {:(23.3)ds^(2)=-e^(2phi)dt^(2)+e^(2Lambda)dr^(2)+R^(2)dOmega^(2):}\begin{equation*} d s^{2}=-e^{2 \phi} d t^{2}+e^{2 \Lambda} d r^{2}+R^{2} d \Omega^{2} \tag{23.3} \end{equation*}(23.3)ds2=e2ϕdt2+e2Λdr2+R2dΩ2
where Φ , Λ Φ , Λ Phi,Lambda\Phi, \LambdaΦ,Λ, and R R RRR are functions of r r rrr only. (The static assumption demands g μ ν / t = 0 g μ ν / t = 0 delg_(mu nu)//del t=0\partial g_{\mu \nu} / \partial t=0gμν/t=0.) To verify that this guess is good, use it in constructing stellar models, and check that the resulting models have the same generality (same set of quantities freely specifiable) as in Newtonian theory and as expected from general physical considerations. An apparently more general metric
(23.4) d s 2 = a 2 d t 2 2 a b d r d t + c 2 d r 2 + R 2 d Ω 2 (23.4) d s 2 = a 2 d t 2 2 a b d r d t + c 2 d r 2 + R 2 d Ω 2 {:(23.4)ds^(2)=-a^(2)dt^(2)-2abdrdt+c^(2)dr^(2)+R^(2)dOmega^(2):}\begin{equation*} d s^{2}=-a^{2} d t^{2}-2 a b d r d t+c^{2} d r^{2}+R^{2} d \Omega^{2} \tag{23.4} \end{equation*}(23.4)ds2=a2dt22abdrdt+c2dr2+R2dΩ2
actually is not more general in any physical sense. One can perform a coordinate transformation to a new time coordinate t t t^(')t^{\prime}t defined by
(23.5) e d t = a d t + b d r (23.5) e d t = a d t + b d r {:(23.5)e^(""⧸"")dt^(')=adt+bdr:}\begin{equation*} e^{\not} d t^{\prime}=a d t+b d r \tag{23.5} \end{equation*}(23.5)edt=adt+bdr
By inserting this in equation (23.4), and by defining e 2 A b 2 + c 2 e 2 A b 2 + c 2 e^(2A)-=b^(2)+c^(2)e^{2 A} \equiv b^{2}+c^{2}e2Ab2+c2, one obtains the postulated line element (23.3), apart from a prime on the t . t . t.^(**)t .{ }^{*}t.
The necessity to allow for arbitrary coordinates in general relativity may appear burdensome when one is formulating the theory; but it gives an added flexibility, something one should always try to turn to one's advantage when formulating and solving problems. The g r t = 0 g r t = 0 g_(rt)=0g_{r t}=0grt=0 simplification (called a coordinate condition) in equation (23.3) results from an advantageous choice of the t t ttt coordinate. The r r rrr coordinate, however, is also at one's disposal (as long as one chooses it in a way that respects spherical symmetry; thus not r = r + cos θ r = r + cos θ r^(')=r+cos thetar^{\prime}=r+\cos \thetar=r+cosθ ). One can turn this freedom to advantage by introducing a new coordinate r ( r ) r ( r ) r^(')(r)r^{\prime}(r)r(r) defined by
(23.6) r = R ( r ) (23.6) r = R ( r ) {:(23.6)r^(')=R(r):}\begin{equation*} r^{\prime}=R(r) \tag{23.6} \end{equation*}(23.6)r=R(r)
With this choice of the radial coordinate, and with the primes dropped, equation (23.3) reduces to
(23.7) d s 2 = e 2 Φ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 (23.7) d s 2 = e 2 Φ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 {:(23.7)ds^(2)=-e^(2Phi)dt^(2)+e^(2Lambda)dr^(2)+r^(2)dOmega^(2):}\begin{equation*} d s^{2}=-e^{2 \Phi} d t^{2}+e^{2 \Lambda} d r^{2}+r^{2} d \Omega^{2} \tag{23.7} \end{equation*}(23.7)ds2=e2Φdt2+e2Λdr2+r2dΩ2
a line element with just two unknown functions, Φ ( r ) Φ ( r ) Phi(r)\Phi(r)Φ(r) and Λ ( r ) Λ ( r ) Lambda(r)\Lambda(r)Λ(r). This coordinate system and metric have been used in most theoretical models for relativistic stars since the pioneering work of Schwarzschild (1916b), Tolman (1939), and Oppenheimer and Volkoff (1939). These particular coordinates are sometimes called "curvature coordinates" and sometimes "Schwarzschild coordinates." The central idea of these coordinates, in a nutshell, is (Schwarzschild r r rrr-coordinate) = = === (proper circumference) / 2 π / 2 π //2pi/ 2 \pi/2π.
For a more rigorous proof that in any static spherical system Schwarzschild coordinates can be introduced, bringing the metric into the simple form (23.7), see Box 23.3 at the end of this chapter.

Exercise 23.1. ISOTROPIC COORDINATES AND NEWTONIAN LIMIT

An alternative set of coordinates sometimes used for static, spherical systems is the "isotropic coordinate system" ( t , r ¯ , θ , ϕ ) ( t , r ¯ , θ , ϕ ) (t, bar(r),theta,phi)(t, \bar{r}, \theta, \phi)(t,r¯,θ,ϕ). The metric in isotropic coordinates has the form
(23.8) d s 2 = e 2 ϕ d t 2 + e 2 μ [ d r ¯ 2 + r ¯ 2 d Ω 2 ] (23.8) d s 2 = e 2 ϕ d t 2 + e 2 μ d r ¯ 2 + r ¯ 2 d Ω 2 {:(23.8)ds^(2)=-e^(2phi)dt^(2)+e^(2mu)[d bar(r)^(2)+ bar(r)^(2)dOmega^(2)]:}\begin{equation*} d s^{2}=-e^{2 \phi} d t^{2}+e^{2 \mu}\left[d \bar{r}^{2}+\bar{r}^{2} d \Omega^{2}\right] \tag{23.8} \end{equation*}(23.8)ds2=e2ϕdt2+e2μ[dr¯2+r¯2dΩ2]
with Φ Φ Phi\PhiΦ and μ μ mu\muμ being functions of r ¯ r ¯ bar(r)\bar{r}r¯.
(a) Exhibit the coordinate transformation connecting the Schwarzschild coordinates (23.7) to the isotropic coordinates (23.8).
(b) From equation (16.2a) [or equivalently (18.15c)], show that, in the Newtonian limit, the metric coefficient Φ Φ Phi\PhiΦ of the isotropic line element becomes the Newtonian potential; and μ μ mu\muμ becomes equal to Φ Φ -Phi-\PhiΦ. By combining with part (a), discover that Λ = r d Φ / d r Λ = r d Φ / d r Lambda=rd Phi//dr\Lambda=r d \Phi / d rΛ=rdΦ/dr in the Newtonian limit.

EXERCISE

§23.3. PHYSICAL INTERPRETATION OF SCHWARZSCHILD COORDINATES

In general relativity, because the use of arbitrary coordinates is permitted, the physical significance of statements about tensor or vector components and other quantities is not always obvious. There are, however, some situations where the interpretation is almost as straightforward as in special relativity. The most obvious example is the center point of a local inertial coordinate system, where the principle of equivalence allows one to treat all local quantities (quantities not involving spacetime curvature) exactly as in special relativity. Schwarzschild coordinates for a spherical system turn out to be a second example.
One's first reaction when meeting a new metric should be to examine it, not in order to learn about the gravitational field, for which the curvature tensor is more
The form of any metric can reveal the nature of the coordinates being used
Geometric significance of the Schwarzschild coordinates:
(1) θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ are angles on sphere
(2) r r rrr measures surface area of sphere
(3) t t ttt has 3 special geometric properties
(4) description of a "machine" to measure t t ttt
directly informative, but to learn about the coordinates. (Are they, for instance, locally inertial at some point?)
The names given to the coordinates have no intrinsic significance. A coordinate transformation t = θ , r = ϕ , θ = r , ϕ = t t = θ , r = ϕ , θ = r , ϕ = t t^(')=theta,r^(')=phi,theta^(')=r,phi^(')=tt^{\prime}=\theta, r^{\prime}=\phi, \theta^{\prime}=r, \phi^{\prime}=tt=θ,r=ϕ,θ=r,ϕ=t is perfectly permissible, and has no influence on the physics or the mathematics of a relativistic problem. The only thing it affects is easy communication between the investigator who adopts it and his colleagues. Thus the names tr θ ϕ tr θ ϕ tr theta phi\operatorname{tr} \theta \phitrθϕ for the Schwarzschild coordinates (23.7) provide a mnemonic device pointing out the geometric content of the coordinates.* In particular, the names θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ are justified by the fact that on each two-dimensional surface of constant r r rrr and t t ttt, the distance between two nearby events is given by d s 2 = r 2 d Ω 2 d s 2 = r 2 d Ω 2 ds^(2)=r^(2)dOmega^(2)d s^{2}=r^{2} d \Omega^{2}ds2=r2dΩ2, as befits standard θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ coordinates on a sphere of radius r r rrr. The area of this two-dimensional sphere is clearly
(23.9) A = ( r d θ ) ( r sin θ d ϕ ) = 4 π r 2 (23.9) A = ( r d θ ) ( r sin θ d ϕ ) = 4 π r 2 {:(23.9)A=int(rd theta)(r sin theta d phi)=4pir^(2):}\begin{equation*} A=\int(r d \theta)(r \sin \theta d \phi)=4 \pi r^{2} \tag{23.9} \end{equation*}(23.9)A=(rdθ)(rsinθdϕ)=4πr2
hence, the metric (23.7) tells how to measure the r r rrr coordinate that it employs. One can merely measure (in proper length units) the area A A AAA of the sphere, composed of all points rotationally equivalent to the point P P P\mathscr{P}P for which the value r ( P ) r ( P ) r(P)r(\mathscr{P})r(P) is desired; and one can then calculate
( ) r ( P ) = ( proper area of sphere through point P / 4 π ) 1 / 2 . ( ) r ( P ) =  proper area of sphere   through point  P / 4 π 1 / 2 . {:('")"r(P)=([" proper area of sphere "],[" through point "P]//4pi)^(1//2).:}r(\mathscr{P})=\left(\begin{array}{l} \text { proper area of sphere } \tag{$\prime$}\\ \text { through point } \mathscr{P} \end{array} / 4 \pi\right)^{1 / 2} .()r(P)=( proper area of sphere  through point P/4π)1/2.
The Schwarzschild coordinates have been picked for convenience, and not for the ease with which one could build a coordinate-measuring machine. This makes it more difficult to design a machine to measure t t ttt than machines to measure r , θ , ϕ r , θ , ϕ r,theta,phir, \theta, \phir,θ,ϕ.
The geometric properties of t t ttt on which a measuring device can be based are: (1) the time-independent distances ( g α β / t = 0 g α β / t = 0 delg_(alpha beta)//del t=0\partial g_{\alpha \beta} / \partial t=0gαβ/t=0 ) between world lines of constant r , θ , ϕ ; ( 2 ) r , θ , ϕ ; ( 2 ) r,theta,phi;(2)r, \theta, \phi ;(2)r,θ,ϕ;(2) the orthogonality ( g t r = g t θ = g t ϕ = 0 ) g t r = g t θ = g t ϕ = 0 (g_(tr)=g_(t theta)=g_(t phi)=0)\left(g_{t r}=g_{t \theta}=g_{t \phi}=0\right)(gtr=gtθ=gtϕ=0) of these world lines to the t = t = t=t=t= constant hypersurfaces; and (3) a labeling of these hypersurfaces by Minkowski (special relativistic) coordinate time at spatial infinity, where spacetime becomes flat. This labeling produces a constraint
(23.10) Φ ( ) = 0 (23.10) Φ ( ) = 0 {:(23.10)Phi(oo)=0:}\begin{equation*} \Phi(\infty)=0 \tag{23.10} \end{equation*}(23.10)Φ()=0
in the metric (23.7). [Mathematically, this constraint is imposed by a simple rescaling transformation t = e Φ ( x ) t t = e Φ ( x ) t t^(')=e^(Phi(x))tt^{\prime}=e^{\Phi(x)} tt=eΦ(x)t, and by then dropping the prime.]
One "machine" design which constructs (mentally) such a t t ttt coordinate, and in the process measures it, is the following. Observers using radar sets arrange to move along the coordinate lines r , θ , ϕ = r , θ , ϕ = r,theta,phi=r, \theta, \phi=r,θ,ϕ= const. They do this by adjusting their velocities until each finds that the radar echos from his neighbors, or from "benchmark" reference points in the asymptotically flat space, require the same round-trip time at each repetition. Equivalently, each returning echo must show zero doppler shift;
*For an example of misleading names, consider those in the equation d s 2 = e 2 ϕ ( θ ) d ϕ 2 + e 2 A ( θ ) d θ 2 + θ 2 ( d t 2 + sin 2 t d r 2 ) ,  *For an example of misleading names, consider those in the equation  d s 2 = e 2 ϕ ( θ ) d ϕ 2 + e 2 A θ d θ 2 + θ 2 d t 2 + sin 2 t d r 2 , {:[" *For an example of misleading names, consider those in the equation "],[qquad ds^(2)=-e^(2phi(theta))dphi^('2)+e^(2A(theta^(')))dtheta^('2)+theta^('2)(dt^('2)+sin^(2)t^(')dr^('2))","]:}\begin{aligned} & \text { *For an example of misleading names, consider those in the equation } \\ & \qquad d s^{2}=-e^{2 \phi(\theta)} d \phi^{\prime 2}+e^{2 A\left(\theta^{\prime}\right)} d \theta^{\prime 2}+\theta^{\prime 2}\left(d t^{\prime 2}+\sin ^{2} t^{\prime} d r^{\prime 2}\right), \end{aligned} *For an example of misleading names, consider those in the equation ds2=e2ϕ(θ)dϕ2+e2A(θ)dθ2+θ2(dt2+sin2tdr2),
which is equivalent to equation (23.7), but employs the coordinates t = θ , r = ϕ , θ = r , ϕ = t t = θ , r = ϕ , θ = r , ϕ = t t^(')=theta,r^(')=phi,theta^(')=r,phi^(')=tt^{\prime}=\theta, r^{\prime}=\phi, \theta^{\prime}=r, \phi^{\prime}=tt=θ,r=ϕ,θ=r,ϕ=t.
it must return with the same frequency at which it was sent out. Next a master clock is set up near spatial infinity (far from the star). It is constructed to measure proper time-which, for it, is Minkowski time "at infinity"-and to emit a standard oneHertz signal. Each observer adjusts the rate of his "coordinate clock" to beat in time with the signals he receives from the master clock. To set the zero of his "coordinate clock," now that its rate is correct, he synchronizes with the master clock, taking account of the coordinate time Δ t Δ t Delta t\Delta tΔt required for radar signals to travel from the master to him. [To compute the transit time, he assumes that for radar signals ( t reflection t reflection  t_("reflection ")-t_{\text {reflection }}-treflection  t emission ) = ( t return t reflection ) = Δ t t emission  = t return  t reflection  = Δ t {:t_("emission "))=(t_("return ")-t_("reflection "))=Delta t\left.t_{\text {emission }}\right)=\left(t_{\text {return }}-t_{\text {reflection }}\right)=\Delta ttemission )=(treturn treflection )=Δt, so that the echo is obtained by time-inversion about the reflection event. This time-reversal invariance distinguishes the time t t ttt in the metric (23.7) from the more general t t ttt coordinates allowed by equation (23.4).] Each observer moving along a coordinate line ( r , θ , ϕ = r , θ , ϕ = r,theta,phi=r, \theta, \phi=r,θ,ϕ= const.) now has a clock that measures coordinate time t t ttt in his neighborhood.
The above discussion identifies the Schwarzschild coordinates of equation (23.7) by their intrinsic geometric properties. Not only are r r rrr and t t ttt radial and time variables, respectively (in that / r / r del//del r\partial / \partial r/r and / t / t del//del t\partial / \partial t/t are spacelike and timelike, respectively, and are orthogonal also to the spheres defined by rotational symmetry), but they have particular properties [ 4 π r 2 = 4 π r 2 = [4pir^(2)=:}\left[4 \pi r^{2}=\right.[4πr2= surface area; g μ ν / t = 0 ; / r / t = g r t = 0 g μ ν / t = 0 ; / r / t = g r t = 0 delg_(mu nu)//del t=0;del//del r*del//del t=g_(rt)=0\partial g_{\mu \nu} / \partial t=0 ; \partial / \partial r \cdot \partial / \partial t=g_{r t}=0gμν/t=0;/r/t=grt=0; / t / t = g t t = 1 / t / t = g t t = 1 del//del t*del//del t=g_(tt)=-1\partial / \partial t \cdot \partial / \partial t=g_{t t}=-1/t/t=gtt=1 at r = r = r=oor=\inftyr= ] that distinguish them from other possible coordinate choices [ r = f ( r ) , t = t + F ( r ) ] r = f ( r ) , t = t + F ( r ) [r^(')=f(r),t^(')=t+F(r)]\left[r^{\prime}=f(r), t^{\prime}=t+F(r)\right][r=f(r),t=t+F(r)]. No claim is made that these are the only coordinates that might reasonably be called r r rrr and t t ttt; for an alternative choice ("isotropic coordinates"), see exercise 23.1. However, they provide a choice that is reasonable, vnambiguous, useful, and often used.

§23.4. DESCRIPTION OF THE MATTER INSIDE A STAR

To high precision, the matter inside any star is a perfect fluid. (Shear stresses are negligible, and energy transport is negligible on a "hydrodynamic time scale.") Thus, it is reasonable in model building to describe the matter by perfect-fluid parameters:
ρ = ρ ( r ) = density of mass-energy in rest-frame of fluid; p = p ( r ) = isotropic pressure in rest-frame of fluid; (23.11) n = n ( r ) = number density of baryons in rest-frame of fluid; u μ = u μ ( r ) = 4 -velocity of fluid; (23.12) T μ ν = ( ρ + p ) u μ u ν + p g μ ν = stress-energy tensor of fluid. ρ = ρ ( r ) =  density of mass-energy in rest-frame of fluid;  p = p ( r ) =  isotropic pressure in rest-frame of fluid;  (23.11) n = n ( r ) =  number density of baryons in rest-frame of fluid;  u μ = u μ ( r ) = 4 -velocity of fluid;  (23.12) T μ ν = ( ρ + p ) u μ u ν + p g μ ν =  stress-energy tensor of fluid.  {:[rho=rho(r)=" density of mass-energy in rest-frame of fluid; "],[p=p(r)=" isotropic pressure in rest-frame of fluid; "],[(23.11)n=n(r)=" number density of baryons in rest-frame of fluid; "],[u^(mu)=u^(mu)(r)=4"-velocity of fluid; "],[(23.12)T^(mu nu)=(rho+p)u^(mu)u^(nu)+pg^(mu nu)=" stress-energy tensor of fluid. "]:}\begin{align*} \rho & =\rho(r)=\text { density of mass-energy in rest-frame of fluid; } \\ p & =p(r)=\text { isotropic pressure in rest-frame of fluid; } \\ n & =n(r)=\text { number density of baryons in rest-frame of fluid; } \tag{23.11}\\ u^{\mu} & =u^{\mu}(r)=4 \text {-velocity of fluid; } \\ T^{\mu \nu} & =(\rho+p) u^{\mu} u^{\nu}+p g^{\mu \nu}=\text { stress-energy tensor of fluid. } \tag{23.12} \end{align*}ρ=ρ(r)= density of mass-energy in rest-frame of fluid; p=p(r)= isotropic pressure in rest-frame of fluid; (23.11)n=n(r)= number density of baryons in rest-frame of fluid; uμ=uμ(r)=4-velocity of fluid; (23.12)Tμν=(ρ+p)uμuν+pgμν= stress-energy tensor of fluid. 
(For Track-1 discussion, see Box 5.1; for greater Track-2 detail, see § § 22.2 § § 22.2 §§22.2\S \S 22.2§§22.2 and 22.3.) In order that the star be static, each element of fluid must remain always at rest in the static coordinate system; i.e., each element must move along a world line of constant r , θ , ϕ r , θ , ϕ r,theta,phir, \theta, \phir,θ,ϕ; i.e., each element must have 4 -velocity components
(23.13a) u r = d r / d τ = 0 , u θ = d θ / d τ = 0 , u ϕ = d ϕ / d τ = 0 (23.13a) u r = d r / d τ = 0 , u θ = d θ / d τ = 0 , u ϕ = d ϕ / d τ = 0 {:(23.13a)u^(r)=dr//d tau=0","quadu^(theta)=d theta//d tau=0","quadu^(phi)=d phi//d tau=0:}\begin{equation*} u^{r}=d r / d \tau=0, \quad u^{\theta}=d \theta / d \tau=0, \quad u^{\phi}=d \phi / d \tau=0 \tag{23.13a} \end{equation*}(23.13a)ur=dr/dτ=0,uθ=dθ/dτ=0,uϕ=dϕ/dτ=0
Material inside star to be
idealized as perfect fluid
Parameters describing perfect fluid:
(1) ρ , p , n ρ , p , n rho,p,n\rho, p, nρ,p,n
(2) u u u\boldsymbol{u}u

(1)

Other coordinates are possible, but Schwarzschild are particularly simple
The normalization of 4 -velocity,
1 = u u = g μ ν u μ u ν = g t t u t u t = e 2 Φ u t u t 1 = u u = g μ ν u μ u ν = g t t u t u t = e 2 Φ u t u t -1=u*u=g_(mu nu)u^(mu)u^(nu)=g_(tt)u^(t)u^(t)=-e^(2Phi)u^(t)u^(t)-1=\boldsymbol{u} \cdot \boldsymbol{u}=g_{\mu \nu} u^{\mu} u^{\nu}=g_{t t} u^{t} u^{t}=-e^{2 \Phi} u^{t} u^{t}1=uu=gμνuμuν=gttutut=e2Φutut
then determines u t u t u^(t)u^{t}ut,
(23.13b) u t = d t / d τ = e Φ , u = e Φ / t (23.13b) u t = d t / d τ = e Φ , u = e Φ / t {:(23.13b)u^(t)=dt//d tau=e^(-Phi)","quad u=e^(-Phi)del//del t:}\begin{equation*} u^{t}=d t / d \tau=e^{-\Phi}, \quad \boldsymbol{u}=e^{-\Phi} \partial / \partial t \tag{23.13b} \end{equation*}(23.13b)ut=dt/dτ=eΦ,u=eΦ/t
and this, together with the general form (23.12) of the stress-energy tensor and the form (23.7) of the metric, determines T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν :
(23.14) T 00 = ρ e 2 ϕ , T r r = p e 2 Λ , T θ θ = p r 2 , T ϕ ϕ = p r 2 sin 2 θ , T α β = 0 if α β . (23.14) T 00 = ρ e 2 ϕ , T r r = p e 2 Λ , T θ θ = p r 2 , T ϕ ϕ = p r 2 sin 2 θ , T α β = 0  if  α β . {:[(23.14)T^(00)=rhoe^(-2phi)","quadT^(rr)=pe^(-2Lambda)","quadT^(theta theta)=pr^(-2)","quadT^(phi phi)=pr^(-2)sin^(-2)theta","],[T^(alpha beta)=0" if "alpha!=beta.]:}\begin{gather*} T^{00}=\rho e^{-2 \phi}, \quad T^{r r}=p e^{-2 \Lambda}, \quad T^{\theta \theta}=p r^{-2}, \quad T^{\phi \phi}=p r^{-2} \sin ^{-2} \theta, \tag{23.14}\\ T^{\alpha \beta}=0 \text { if } \alpha \neq \beta . \end{gather*}(23.14)T00=ρe2ϕ,Trr=pe2Λ,Tθθ=pr2,Tϕϕ=pr2sin2θ,Tαβ=0 if αβ.
Although these components of the stress-energy tensor in Schwarzschild coordinates are useful for calculations, the normalization factors e 2 Φ , e 2 Λ , r 2 , r 2 sin 2 θ e 2 Φ , e 2 Λ , r 2 , r 2 sin 2 θ e^(-2Phi),e^(-2Lambda),r^(-2),r^(-2)sin^(-2)thetae^{-2 \Phi}, e^{-2 \Lambda}, r^{-2}, r^{-2} \sin ^{-2} \thetae2Φ,e2Λ,r2,r2sin2θ make them inconvenient for physical interpretations. More convenient are components on orthonormal tetrads carried by the fluid elements ("proper reference frames"; see §13.6):
Proper reference frame of fluid
Components of u u u\boldsymbol{u}u and T T T\boldsymbol{T}T in proper reference frame
Equation of state:
(1) in general
(2) idealized to "one-parameter form" p = p ( n ) , ρ = ρ ( n ) p = p ( n ) , ρ = ρ ( n ) p=p(n),rho=rho(n)p=p(n), \rho=\rho(n)p=p(n),ρ=ρ(n)
(23.15d) e t ^ d d τ = 1 e ϕ t , e r ^ = 1 e A r , e θ ^ = 1 r θ , e ϕ ^ = 1 r sin θ ϕ ; ω t ^ = e ϕ d t , w r ^ = e Λ d r , w θ ^ = r d θ , ω ϕ ^ = r sin θ d ϕ u = e t ^ ; u t ^ = 1 , u r ^ = u θ ^ = u ϕ ^ = 0 ; T i t ^ t ^ T 0 ^ 0 ^ = ρ , T r ^ r ^ = T θ ^ θ ^ = T ϕ ^ ϕ ^ = p , T α ^ β ^ = 0 if α β . (23.15d) e t ^ d d τ = 1 e ϕ t , e r ^ = 1 e A r , e θ ^ = 1 r θ , e ϕ ^ = 1 r sin θ ϕ ; ω t ^ = e ϕ d t , w r ^ = e Λ d r , w θ ^ = r d θ , ω ϕ ^ = r sin θ d ϕ u = e t ^ ; u t ^ = 1 , u r ^ = u θ ^ = u ϕ ^ = 0 ; T i t ^ t ^ T 0 ^ 0 ^ = ρ , T r ^ r ^ = T θ ^ θ ^ = T ϕ ^ ϕ ^ = p , T α ^ β ^ = 0  if  α β . {:(23.15d){:[e_( hat(t))-=(d)/(d tau)=(1)/(e^(phi))(del)/(del t)","quade_( hat(r))=(1)/(e^(A))(del)/(del r)","quade_( hat(theta))=(1)/(r)(del)/(del theta)","quade_( hat(phi))=(1)/(r sin theta)(del)/(del phi);],[omega^( hat(t))=e^(phi)dt","quadw^( hat(r))=e^(Lambda)dr","quadw^( hat(theta))=rd theta","quadomega^( hat(phi))=r sin theta d phi],[u=e_( hat(t));quadu^( hat(t))=1","quadu^( hat(r))=u^( hat(theta))=u^( hat(phi))=0;],[T_( hat(it) hat(t))-=T_( hat(0) hat(0))=rho","quadT_( hat(r) hat(r))=T_( hat(theta) hat(theta))=T_( hat(phi) hat(phi))=p","quadT_( hat(alpha) hat(beta))=0" if "alpha!=beta.]:}:}\begin{array}{r} \boldsymbol{e}_{\hat{t}} \equiv \frac{d}{d \tau}=\frac{1}{e^{\boldsymbol{\phi}}} \frac{\partial}{\partial t}, \quad \boldsymbol{e}_{\hat{r}}=\frac{1}{e^{A}} \frac{\partial}{\partial r}, \quad \boldsymbol{e}_{\hat{\theta}}=\frac{1}{r} \frac{\partial}{\partial \theta}, \quad \boldsymbol{e}_{\hat{\phi}}=\frac{1}{r \sin \theta} \frac{\partial}{\partial \phi} ; \\ \boldsymbol{\omega}^{\hat{t}}=e^{\phi} \boldsymbol{d} t, \quad \boldsymbol{w}^{\hat{r}}=e^{\Lambda} \boldsymbol{d} r, \quad \boldsymbol{w}^{\hat{\theta}}=r \boldsymbol{d} \theta, \quad \boldsymbol{\omega}^{\hat{\phi}}=r \sin \theta \boldsymbol{d} \phi \\ \boldsymbol{u}=\boldsymbol{e}_{\hat{t}} ; \quad u^{\hat{t}}=1, \quad u^{\hat{r}}=u^{\hat{\theta}}=u^{\hat{\phi}}=0 ; \\ T_{\hat{i t} \hat{t}} \equiv T_{\hat{0} \hat{0}}=\rho, \quad T_{\hat{r} \hat{r}}=T_{\hat{\theta} \hat{\theta}}=T_{\hat{\phi} \hat{\phi}}=p, \quad T_{\hat{\alpha} \hat{\beta}}=0 \text { if } \alpha \neq \beta . \tag{23.15d} \end{array}(23.15d)et^ddτ=1eϕt,er^=1eAr,eθ^=1rθ,eϕ^=1rsinθϕ;ωt^=eϕdt,wr^=eΛdr,wθ^=rdθ,ωϕ^=rsinθdϕu=et^;ut^=1,ur^=uθ^=uϕ^=0;Tit^t^T0^0^=ρ,Tr^r^=Tθ^θ^=Tϕ^ϕ^=p,Tα^β^=0 if αβ.
See exercise 23.2 below.
The structure of a star-i.e., the set of functions Φ ( r ) , Λ ( r ) , ρ ( r ) , p ( r ) , n ( r ) Φ ( r ) , Λ ( r ) , ρ ( r ) , p ( r ) , n ( r ) Phi(r),Lambda(r),rho(r),p(r),n(r)\Phi(r), \Lambda(r), \rho(r), p(r), n(r)Φ(r),Λ(r),ρ(r),p(r),n(r)-is determined in part by the Einstein field equations, G μ ν = 8 π T μ ν G μ ν = 8 π T μ ν G^(mu nu)=8piT^(mu nu)G^{\mu \nu}=8 \pi T^{\mu \nu}Gμν=8πTμν, and in part by the law of local conservation of energy-momentum in the fluid, T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0. However, these are not sufficient to fix the structure uniquely. Also necessary is the functional dependence of pressure p p ppp and density ρ ρ rho\rhoρ on number density of baryons n n nnn :
(23.16) p = p ( n ) , ρ = ρ ( n ) (23.16) p = p ( n ) , ρ = ρ ( n ) {:(23.16)p=p(n)","quad rho=rho(n):}\begin{equation*} p=p(n), \quad \rho=\rho(n) \tag{23.16} \end{equation*}(23.16)p=p(n),ρ=ρ(n)
Normally one cannot deduce p p ppp and ρ ρ rho\rhoρ from a knowledge solely of n n nnn. One must know, in addition, the temperature T T TTT or the entropy per baryon s s sss; then the laws of thermodynamics plus equations of state will determine all remaining thermodynamic variables:
p = p ( n , s ) , ρ = ρ ( n , s ) , p = p ( n , s ) , ρ = ρ ( n , s ) , p=p(n,s),quad rho=rho(n,s),dotsp=p(n, s), \quad \rho=\rho(n, s), \ldotsp=p(n,s),ρ=ρ(n,s),
(See § 22.2 § 22.2 §22.2\S 22.2§22.2 and Box 22.1 for full Track-2 discussions.) To pass from the given thermodynamic knowledge, p ( n , s ) p ( n , s ) p(n,s)p(n, s)p(n,s) and ρ ( n , s ) ρ ( n , s ) rho(n,s)\rho(n, s)ρ(n,s), to the desired knowledge, p ( n ) p ( n ) p(n)p(n)p(n) and ρ ( n ) ρ ( n ) rho(n)\rho(n)ρ(n), one needs information about the star's thermal properties, and especially about the way in which energy generation plus heat flow have conspired to distribute the entropy, s = s ( n ) s = s ( n ) s=s(n)s=s(n)s=s(n) :
p ( n ) = p [ n , s ( n ) ] , ρ ( n ) = ρ [ n , s ( n ) ] . p ( n ) = p [ n , s ( n ) ] , ρ ( n ) = ρ [ n , s ( n ) ] . p(n)=p[n,s(n)],quad rho(n)=rho[n,s(n)].p(n)=p[n, s(n)], \quad \rho(n)=\rho[n, s(n)] .p(n)=p[n,s(n)],ρ(n)=ρ[n,s(n)].
There exist three important applications of the theory of relativistic stars: neutron stars, white dwarfs, and supermassive stars (stars with M 10 3 M M 10 3 M M >= 10^(3)M_(o.)M \geq 10^{3} M_{\odot}M103M, which may exist according to theory, but the existence of which has never yet been confirmed by observation). In all three cases, happily, the passage from p = p ( n , s ) , ρ ( n , s ) p = p ( n , s ) , ρ ( n , s ) p=p(n,s),rho(n,s)p=p(n, s), \rho(n, s)p=p(n,s),ρ(n,s), to p = p ( n ) , ρ = ρ ( n ) p = p ( n ) , ρ = ρ ( n ) p=p(n),rho=rho(n)p=p(n), \rho=\rho(n)p=p(n),ρ=ρ(n), is trivial.
Consider first a neutron star. Though hot by ordinary standards, a neutron star is so cold by any nuclear-matter scale of temperatures that essentially all its thermal degrees of freedom are frozen out ("degenerate gas"; "quantum fluid"). It is not important that a detailed treatment of the substance of a neutron star is beyond the capability of present theory (allowance for the interaction between baryon and baryon; production at sufficiently high pressures of hyperons and mesons). The simple fact is that one is dealing with matter at densities comparable to the density of matter in an atomic nucleus ( 2 × 10 14 g / cm 3 ) 2 × 10 14 g / cm 3 (2xx10^(14)(g)//cm^(3))\left(2 \times 10^{14} \mathrm{~g} / \mathrm{cm}^{3}\right)(2×1014 g/cm3) and higher. Everything one knows about nuclear matter [see, for example, Bohr and Mottelson (1969)] tells one that it is degenerate, and that one can estimate in order of magnitude its degeneracy temperature by treating it as though it were an ideal Fermi neutron gas. (In a normal atomic nucleus, a little more than 50 per cent of all baryons are neutrons, the rest are protons; in a neutron star, as many as 99 per cent are neutrons.) When approximating the neutron-star matter as an ideal Fermi neutron gas, one considers the neutrons to occupy free-particle quantum states, with two particles of opposite spin in each occupied state, and a sharp drop from 100 per cent occupancy of quantum states to empty states when the particle energy rises to the level of the "Fermi energy" [for more on such an ideal Fermi gas, see Kittel, Section 19 (1958); or at an introductory level, see Sears, Section 16-5 (1953)]. In matter at nuclear density, the Fermi energy is of the order
E Fermi 30 MeV or 3 × 10 11 K E Fermi  30 MeV  or  3 × 10 11 K E_("Fermi ")∼30MeV" or "3xx10^(11)KE_{\text {Fermi }} \sim 30 \mathrm{MeV} \text { or } 3 \times 10^{11} \mathrm{~K}EFermi 30MeV or 3×1011 K
and at higher density the temperature required to unfreeze the degeneracy is even greater. In other words, for matter at and above nuclear densities, already at zero temperature the kinetic energy of the particles (governed by the Pauli exclusion principle and by their Fermi energy) is a primary source of pressure. Nuclear forces make a large correction to this pressure, but for T 30 MeV = 3 × 10 11 K T 30 MeV = 3 × 10 11 K T <= 30MeV=3xx10^(11)KT \leqq 30 \mathrm{MeV}=3 \times 10^{11} \mathrm{~K}T30MeV=3×1011 K, energies of thermal agitation do not.
A star, in collapsing from a normal state to a neutron-star state (see Chapter 24), emits a huge flux of neutrinos at temperatures 10 10 K 10 10 K >= 10^(10)K\geq 10^{10} \mathrm{~K}1010 K, and thereby cools to T 3 × 10 11 K T 3 × 10 11 K T≪3xx10^(11)KT \ll 3 \times 10^{11} \mathrm{~K}T3×1011 K within a few seconds after formation. Consequently, in all neutron stars older than a few seconds one can neglect thermal contributions to the pressure and density; i.e., one can set
p ( n , s ) = p ( n , s = 0 ) = p ( n ) , ρ ( n , s ) = ρ ( n , s = 0 ) = ρ ( n ) p ( n , s ) = p ( n , s = 0 ) = p ( n ) , ρ ( n , s ) = ρ ( n , s = 0 ) = ρ ( n ) p(n,s)=p(n,s=0)=p(n),quad rho(n,s)=rho(n,s=0)=rho(n)p(n, s)=p(n, s=0)=p(n), \quad \rho(n, s)=\rho(n, s=0)=\rho(n)p(n,s)=p(n,s=0)=p(n),ρ(n,s)=ρ(n,s=0)=ρ(n)
A white dwarf is similar, except that here electrons rather than neutrons are the source of Fermi gas pressure and degeneracy. Typical white-dwarf temperatures satisfy
k T E Fermi electrons ; k T E Fermi electrons  ; kT≪E_("Fermi electrons ");k T \ll E_{\text {Fermi electrons }} ;kTEFermi electrons ;
Justification for idealized equation of state:
(1) in neutron stars
(2) in white dwarfs
(3) in supermassive stars
the Fermi kinetic energy (Pauli exclusion principle), and not random k T k T kTk TkT energy, is primarily responsible for the pressure and energy density; and one can set
p ( n , s ) = p ( n , s = 0 ) = p ( n ) , ρ ( n , s ) = ρ ( n , s = 0 ) = ρ ( n ) . p ( n , s ) = p ( n , s = 0 ) = p ( n ) , ρ ( n , s ) = ρ ( n , s = 0 ) = ρ ( n ) . p(n,s)=p(n,s=0)=p(n),quad rho(n,s)=rho(n,s=0)=rho(n).p(n, s)=p(n, s=0)=p(n), \quad \rho(n, s)=\rho(n, s=0)=\rho(n) .p(n,s)=p(n,s=0)=p(n),ρ(n,s)=ρ(n,s=0)=ρ(n).
In a supermassive star (see Chapter 24), the situation is quite different. There temperature and entropy are almost the whole story, so far as pressure and energy density are concerned. However, convection keeps the star stirred up and produces a uniform entropy distribution
s = const. independent of radius; s =  const. independent of radius;  s=" const. independent of radius; "s=\text { const. independent of radius; }s= const. independent of radius; 
so one can write
p ( n , s ) = p s ( n ) , ρ ( n , s ) = ρ s ( n ) . [ functions depending on uniform entropy per baryon, s , in the star ] p ( n , s ) = p s ( n ) , ρ ( n , s ) = ρ s ( n ) .  functions depending on   uniform entropy per baryon,  s ,  in the star  {:[p(n","s)=p_(s)(n)","quad rho(n","s)=rho_(s)(n).],[qquad],[[[" functions depending on "],[" uniform entropy per baryon, "],[s","" in the star "]]uarr]:}\begin{array}{r} p(n, s)=p_{s}(n), \quad \rho(n, s)=\rho_{s}(n) . \\ \qquad \\ {\left[\begin{array}{l} \text { functions depending on } \\ \text { uniform entropy per baryon, } \\ s, \text { in the star } \end{array}\right] \uparrow} \end{array}p(n,s)=ps(n),ρ(n,s)=ρs(n).[ functions depending on  uniform entropy per baryon, s, in the star ]
In all three cases-neutron stars, white dwarfs, supermassive stars-one regards the relations p ( n ) p ( n ) p(n)p(n)p(n) and ρ ( n ) ρ ( n ) rho(n)\rho(n)ρ(n) as "equations of state"; and having specified them, one can calculate the star's structure without further reference to its thermal properties.

EXERCISE

Exercise 23.2. PROPER REFERENCE FRAMES OF FLUID ELEMENTS

(a) Verify that equations ( 23.15 a , b ) ( 23.15 a , b ) (23.15a,b)(23.15 \mathrm{a}, \mathrm{b})(23.15a,b) define an orthonormal tetrad and its dual basis of 1 -forms, at each event in spacetime.
(b) Verify that the components of the fluid 4 -velocity relative to these tetrads are given by equations ( 23.15 c ). Why do these components guarantee that the tetrads form "proper reference frames" for the fluid elements?
(c) Verify equations ( 23.15 d ) ( 23.15 d ) (23.15d)(23.15 \mathrm{~d})(23.15 d) for the components of the stress-energy tensor.

§23.5. EQUATIONS OF STRUCTURE

Five equations needed to determine 5 stellar-structure functions: Φ , Λ , p , ρ , n Φ , Λ , p , ρ , n Phi,Lambda,p,rho,n\Phi, \Lambda, p, \rho, nΦ,Λ,p,ρ,n
The structure of a relativistic star is determined by five functions of radius r r rrr : the metric functions Φ ( r ) , Λ ( r ) Φ ( r ) , Λ ( r ) Phi(r),Lambda(r)\Phi(r), \Lambda(r)Φ(r),Λ(r), the pressure p ( r ) p ( r ) p(r)p(r)p(r), the density of mass-energy ρ ( r ) ρ ( r ) rho(r)\rho(r)ρ(r), and the number density of baryons, n ( r ) n ( r ) n(r)n(r)n(r). Hence, to determine the structure uniquely, one needs five equations of structure, plus boundary conditions. Two equations of structure, the equations of state p ( n ) p ( n ) p(n)p(n)p(n) and ρ ( n ) ρ ( n ) rho(n)\rho(n)ρ(n), are already in hand. The remaining three must be the essential content of the Einstein field equations and of the law of local energy-momentum conservation, T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0.
One knows that the law of local energy-momentum conservation for the fluid follows as an identity from the Einstein field equations. Without loss of information,
one can therefore impose all ten field equations and ignore local energy-momentum conservation. But that is an inefficient way to proceed. Almost always the equations T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0 can be reduced to usable form more easily than can the field equations. Hence, the most efficient procedure is to: (1) evaluate the four equations T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0; (2) evaluate enough field equations (six) to obtain a complete set ( 6 + 4 = 10 ) ( 6 + 4 = 10 ) (6+4=10)(6+4=10)(6+4=10); and (3) evaluate the remaining four field equations as checks of the results of (1) and (2).
The Track-2 reader has learned ( $ 22.3 $ 22.3 $22.3\$ 22.3$22.3 ) that the equations T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0 for a perfect fluid take on an especially simple form when projected (1) on the 4 -velocity u u u\boldsymbol{u}u of the fluid itself, and (2) orthogonal to u u u\boldsymbol{u}u. Projection along u ( u μ T μ ν ; ν = 0 ) u u μ T μ ν ; ν = 0 u(u_(mu)T^(mu nu)_(;nu)=0)\boldsymbol{u}\left(u_{\mu} T^{\mu \nu}{ }_{; \nu}=0\right)u(uμTμν;ν=0) gives the local law of energy conservation (22.11a),
d ρ d τ = ( ρ + p ) u = ρ + p n d n d τ d ρ d τ = ( ρ + p ) u = ρ + p n d n d τ (d rho)/(d tau)=-(rho+p)grad*u=(rho+p)/(n)(dn)/(d tau)\frac{d \rho}{d \tau}=-(\rho+p) \nabla \cdot \boldsymbol{u}=\frac{\rho+p}{n} \frac{d n}{d \tau}dρdτ=(ρ+p)u=ρ+pndndτ
where u = d / d τ u = d / d τ u=d//d tau\boldsymbol{u}=d / d \tauu=d/dτ; i.e., τ τ tau\tauτ is proper time along the world line of any chosen element of the fluid. For a static star, or for any other static system, both sides of this equation must vanish identically (no fluid element ever sees any change in its own density).
Projection of T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0 orthogonal to u u u\boldsymbol{u}u gives the reasonable equation
( inertial mass per unit volume ) × ( 4 -acceleration ) = ( pressure gradient, projected perpendicular to u ) (  inertial mass   per unit volume  ) × ( 4 -acceleration  ) = (  pressure gradient, projected   perpendicular to  u ) ((" inertial mass ")/(" per unit volume "))xx(4"-acceleration ")=-((" pressure gradient, projected ")/(" perpendicular to "u))\binom{\text { inertial mass }}{\text { per unit volume }} \times(4 \text {-acceleration })=-\binom{\text { pressure gradient, projected }}{\text { perpendicular to } \boldsymbol{u}}( inertial mass  per unit volume )×(4-acceleration )=( pressure gradient, projected  perpendicular to u)
i.e.,
( ρ + p ) u u = [ p + ( u p ) u ] . ( ρ + p ) u u = p + u p u . (rho+p)grad_(u)u=-[grad p+(grad_(u)p)u].(\rho+p) \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=-\left[\boldsymbol{\nabla} p+\left(\boldsymbol{\nabla}_{\boldsymbol{u}} p\right) \boldsymbol{u}\right] .(ρ+p)uu=[p+(up)u].
[see equation (22.13)]. When applied to a static star, this equation tells how much pressure gradient is needed to prevent a fluid element from falling. Only the radial component of this equation has content, since the pressure depends only on r r rrr. The radial component in the Schwarzschild coordinate system says [see the line element (23.7) and the 4 -velocity components (23.13)],
( ρ + p ) u r ; p u ν = ( ρ + p ) Γ r ν α u α u ν = ( ρ + p ) Γ r 0 0 u 0 u 0 (23.17) = ( ρ + p ) Φ , r = p , r ( ρ + p ) u r ; p u ν = ( ρ + p ) Γ r ν α u α u ν = ( ρ + p ) Γ r 0 0 u 0 u 0 (23.17) = ( ρ + p ) Φ , r = p , r {:[(rho+p)u_(r;p)u^(nu)=-(rho+p)Gamma_(r nu)^(alpha)u_(alpha)u^(nu)=-(rho+p)Gamma_(r0)^(0)u_(0)u^(0)],[(23.17)=(rho+p)Phi_(,r)=-p_(,r)]:}\begin{align*} (\rho+p) u_{r ; p} u^{\nu} & =-(\rho+p) \Gamma_{r \nu}^{\alpha} u_{\alpha} u^{\nu}=-(\rho+p) \Gamma_{r 0}^{0} u_{0} u^{0} \\ & =(\rho+p) \Phi_{, r}=-p_{, r} \tag{23.17} \end{align*}(ρ+p)ur;puν=(ρ+p)Γrναuαuν=(ρ+p)Γr00u0u0(23.17)=(ρ+p)Φ,r=p,r
(Track-1 readers can derive this from scratch at the end of the section, exercise 23.3.) In the Newtonian limit, Φ Φ Phi\PhiΦ becomes the Newtonian potential (since g 00 = g 00 = g_(00)=g_{00}=g00= e 2 Φ 1 2 Φ ) e 2 Φ 1 2 Φ {:-e^(2Phi)~~-1-2Phi)\left.-e^{2 \Phi} \approx-1-2 \Phi\right)e2Φ12Φ), and the pressure becomes much smaller than the mass-energy density; consequently equation (23.17) becomes
(23.17~N) ρ Φ , r = p , r (23.17~N) ρ Φ , r = p , r {:(23.17~N)rhoPhi_(,r)=-p_(,r):}\begin{equation*} \rho \Phi_{, r}=-p_{, r} \tag{23.17~N} \end{equation*}(23.17~N)ρΦ,r=p,r
This is the Newtonian version of the equation describing the balance between gravitational force and pressure gradient.
The pressure gradient that prevents a fluid element from falling appears in Einstein's theory as the source of an acceleration. This acceleration, by keeping the fluid element at a fixed r r rrr value, causes it to depart from geodesic motion (from "fiducial world line"; from motion of free fall into the center of the star). Newtonian
The most efficient procedure
Equation of hydrostatic equilibrium derived
for solving Einstein equations
Comparison of Newton and Einstein views of hydrostatic equilibrium

Equation for Λ Λ Lambda\LambdaΛ derived

"Mass-energy inside radius
r r rrr." m ( r ) m ( r ) m(r)m(r)m(r) defined r , m ( r ) r , m ( r ) r,^('')m(r)r,{ }^{\prime \prime} m(r)r,m(r), defined
theory, on the other hand, views as the fiducial world line the one that stays at a fixed r r rrr value. It regards the "gravitational force" as trying (without success, because balanced by the pressure gradient) to pull a particle from a fixed-r world line onto a geodesic world line. In the two theories the magnitudes of the acceleration, whether "actually taking place" (Einstein theory) or "trying to take place" (Newtonian theory), are the same to lowest order (but opposite in direction); so it is no surprise that ( 23.17 ) ( 23.17 ) (23.17)(23.17)(23.17) and ( 23.17 N ) ( 23.17 N ) (23.17N)(23.17 \mathrm{~N})(23.17 N) differ only in detail.
Turn next to the Einstein field equation. Here, as is often the case, the components of the field equation in the fluid's orthonormal frame [equations (23.15a,b)] are simpler than the components in the coordinate basis. One already knows the stressenergy tensor T α ^ β ^ T α ^ β ^ T_( hat(alpha) hat(beta))T_{\hat{\alpha} \hat{\beta}}Tα^β^ in the orthonormal frame [equation (23.15d)]; and Track-2 readers have already calculated the Einstein tensor G α ^ β ^ G α ^ β ^ G_( hat(alpha) hat(beta))G_{\hat{\alpha} \hat{\beta}}Gα^β^ (exercise 14.13 ; Track-1 readers will face the task at the end of this section, exercise 23.4). All that remains is to equate G α ^ β ^ G α ^ β ^ G_( hat(alpha) hat(beta))G_{\hat{\alpha} \hat{\beta}}Gα^β^ to 8 π T α ^ β ^ 8 π T α ^ β ^ 8piT_( hat(alpha) hat(beta))8 \pi T_{\hat{\alpha} \hat{\beta}}8πTα^β^. Examine first the 0 ^ 0 ^ 0 ^ 0 ^ hat(0) hat(0)\hat{0} \hat{0}0^0^ component of the field equations:
G 0 ^ 0 ^ = r 2 r 2 e 2 Λ r 1 ( d / d r ) ( e 2 Λ ) = r 2 ( d / d r ) [ r ( 1 e 2 Λ ) ] = 8 π T 0 ^ 0 ^ = 8 π ρ G 0 ^ 0 ^ = r 2 r 2 e 2 Λ r 1 ( d / d r ) e 2 Λ = r 2 ( d / d r ) r 1 e 2 Λ = 8 π T 0 ^ 0 ^ = 8 π ρ {:[G_( hat(0) hat(0))=r^(-2)-r^(-2)e^(-2Lambda)-r^(-1)(d//dr)(e^(-2Lambda))],[=r^(-2)(d//dr)[r(1-e^(-2Lambda))]=8piT_( hat(0) hat(0))=8pi rho]:}\begin{aligned} G_{\hat{0} \hat{0}} & =r^{-2}-r^{-2} e^{-2 \Lambda}-r^{-1}(d / d r)\left(e^{-2 \Lambda}\right) \\ & =r^{-2}(d / d r)\left[r\left(1-e^{-2 \Lambda}\right)\right]=8 \pi T_{\hat{0} \hat{0}}=8 \pi \rho \end{aligned}G0^0^=r2r2e2Λr1(d/dr)(e2Λ)=r2(d/dr)[r(1e2Λ)]=8πT0^0^=8πρ
This equation becomes easy to solve as soon as one notices that it is a differential equation linear in the quantity e 2 A e 2 A e^(-2A)e^{-2 A}e2A; a bit of tidying up then focuses attention on the quantity r ( 1 e 2 Λ ) r 1 e 2 Λ r(1-e^(-2Lambda))r\left(1-e^{-2 \Lambda}\right)r(1e2Λ). Give this quantity the name 2 m ( r ) 2 m ( r ) 2m(r)2 m(r)2m(r) (so far only a name!); thus,
(23.18) 2 m r ( 1 e 2 Λ ) ; e 2 Λ = ( 1 2 m / r ) 1 (23.18) 2 m r 1 e 2 Λ ; e 2 Λ = ( 1 2 m / r ) 1 {:(23.18)2m-=r(1-e^(-2Lambda));quade^(2Lambda)=(1-2m//r)^(-1):}\begin{equation*} 2 m \equiv r\left(1-e^{-2 \Lambda}\right) ; \quad e^{2 \Lambda}=(1-2 m / r)^{-1} \tag{23.18} \end{equation*}(23.18)2mr(1e2Λ);e2Λ=(12m/r)1
In this notation the 0 ^ 0 ^ 0 ^ 0 ^ hat(0) hat(0)\hat{0} \hat{0}0^0^ component of the Einstein tensor becomes
G 0 ^ 0 ^ = 2 r 2 d m ( r ) d r = 8 π ρ G 0 ^ 0 ^ = 2 r 2 d m ( r ) d r = 8 π ρ G_( hat(0) hat(0))=(2)/(r^(2))(dm(r))/(dr)=8pi rhoG_{\hat{0} \hat{0}}=\frac{2}{r^{2}} \frac{d m(r)}{d r}=8 \pi \rhoG0^0^=2r2dm(r)dr=8πρ
Integrate and find
(23.19) m ( r ) = 0 r 4 π r 2 ρ d r + m ( 0 ) (23.19) m ( r ) = 0 r 4 π r 2 ρ d r + m ( 0 ) {:(23.19)m(r)=int_(0)^(r)4pir^(2)rho dr+m(0):}\begin{equation*} m(r)=\int_{0}^{r} 4 \pi r^{2} \rho d r+m(0) \tag{23.19} \end{equation*}(23.19)m(r)=0r4πr2ρdr+m(0)
For the constant of integration m ( 0 ) m ( 0 ) m(0)m(0)m(0), a zero value means a space geometry smooth at the origin (physically acceptable); a non-zero value means a geometry with a singularity at the origin (physically unacceptable: no local Lorentz frame at r = 0 r = 0 r=0r=0r=0 ):
d s 2 = [ 1 2 m ( 0 ) / r ] 1 d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) [ r / 2 m ( 0 ) ] d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) at r 0 if m ( 0 ) 0 (23.20) d s 2 = [ 1 ( 8 π / 3 ) ρ c r 2 ] 1 d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) at r 0 if m ( 0 ) = 0 d s 2 = [ 1 2 m ( 0 ) / r ] 1 d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 [ r / 2 m ( 0 ) ] d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2  at  r 0  if  m ( 0 ) 0 (23.20) d s 2 = 1 ( 8 π / 3 ) ρ c r 2 1 d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2  at  r 0  if  m ( 0 ) = 0 {:[ds^(2)=[1-2m(0)//r]^(-1)dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))],[~~-[r//2m(0)]dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))quad" at "r~~0" if "m(0)!=0],[(23.20)ds^(2)=[1-(8pi//3)rho_(c)r^(2)]^(-1)dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))],[~~dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))quad" at "r~~0" if "m(0)=0]:}\begin{align*} d s^{2} & =[1-2 m(0) / r]^{-1} d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \\ & \approx-[r / 2 m(0)] d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \quad \text { at } r \approx 0 \text { if } m(0) \neq 0 \\ d s^{2} & =\left[1-(8 \pi / 3) \rho_{c} r^{2}\right]^{-1} d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \tag{23.20}\\ & \approx d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \quad \text { at } r \approx 0 \text { if } m(0)=0 \end{align*}ds2=[12m(0)/r]1dr2+r2(dθ2+sin2θdϕ2)[r/2m(0)]dr2+r2(dθ2+sin2θdϕ2) at r0 if m(0)0(23.20)ds2=[1(8π/3)ρcr2]1dr2+r2(dθ2+sin2θdϕ2)dr2+r2(dθ2+sin2θdϕ2) at r0 if m(0)=0
The quantity m ( r ) m ( r ) m(r)m(r)m(r), defined by equation (23.18) and calculated from equation (23.19) with m ( 0 ) = 0 m ( 0 ) = 0 m(0)=0m(0)=0m(0)=0, is a relativistic analog of the "mass-energy inside radius r r rrr." Box 23.1 spells out the analogy in detail.

Box 23.1 MASS-ENERGY INSIDE RADIUS r r rrr

The total mass-energy M M MMM of an isolated star is well-defined (Chapter 19). But not well-defined, in general, is the distribution of that mass-energy from point to point inside the star and in its gravitational field (no unique "gravitational stress-energy tensor"). This was the crucial message of §20.4 (Track 2).
The message is true in general. But for the case of a spherical star-and only for that case-the message loses its bite. Spherical symmetry allows one to select a distribution of the total mass-energy that is physically reasonable. In Schwarzschild coordinates, it is defined by
(1) "total mass-energy inside radius r " m ( r ) = 0 r 4 π r 2 ρ d r . (1)  "total mass-energy inside radius  r  "  m ( r ) = 0 r 4 π r 2 ρ d r {:(1)" "total mass-energy inside radius "r" " "-=m(r)=int_(0)^(r)4pir^(2)rho dr". ":}\begin{equation*} \text { "total mass-energy inside radius } r \text { " } \equiv m(r)=\int_{0}^{r} 4 \pi r^{2} \rho d r \text {. } \tag{1} \end{equation*}(1) "total mass-energy inside radius r " m(r)=0r4πr2ρdr
The fully convincing argument for this definition is found only by considering a generalization of it to time-dependent spherically symmetric stars (pulsating, collapsing, or exploding stars; see Chapters 26 and 32 , and especially exercise 32.7). For them one finds that the mass-energy m m mmm associated with a given ball of matter (fixed baryon number) can change in time only to the extent that locally measurable energy fluxes can be detected at the boundary of the ball. [Such energy fluxes could be the power expended by pressure forces against the moving boundary surface, or heat fluxes, or radiation (photon or neutrino) fluxes. But since spherically symmetric gravitational waves do not exist (Chapters 35 and 36), neither physical intuition nor Einstein's equations require that problems of localizing gravitational-wave energy be faced.] Thus the energy m m mmm is localized, not by a mathematical convention, but by the circumstance that transfer of energy (with this definition of m m mmm ) is detectable by local measurements. [For the mathematical details of m ( r , t ) m ( r , t ) m(r,t)m(r, t)m(r,t) in the time-dependent case, see Misner and Sharp (1964), Misner (1965), and exercise 32.7.]
In addition to the critical "local energy flux" property of m ( r ) m ( r ) m(r)m(r)m(r) described above, there are three further properties that verify its identification as mass-energy. They are: (1) Everywhere outside the star
(2) m ( r ) = M ( total mass-energy of star as measured from Kepler's third law for distant planets ) (2) m ( r ) = M (  total mass-energy of star as measured from   Kepler's third law for distant planets  ) {:(2)m(r)=M-=((" total mass-energy of star as measured from ")/(" Kepler's third law for distant planets ")):}\begin{equation*} m(r)=M \equiv\binom{\text { total mass-energy of star as measured from }}{\text { Kepler's third law for distant planets }} \tag{2} \end{equation*}(2)m(r)=M( total mass-energy of star as measured from  Kepler's third law for distant planets )
see § 23.6 § 23.6 §23.6\S 23.6§23.6 for proof. (2) For a Newtonian star, where "mass inside radius r r rrr " has a unique meaning, m ( r ) m ( r ) m(r)m(r)m(r) is that mass. (3) For a relativistic star, m ( r ) m ( r ) m(r)m(r)m(r) splits nicely into "rest mass-energy" m 0 ( r ) m 0 ( r ) m_(0)(r)m_{0}(r)m0(r) plus "internal energy" U ( r ) U ( r ) U(r)U(r)U(r) plus "gravitational potential energy" Ω ( r ) Ω ( r ) Omega(r)\Omega(r)Ω(r).
To recognize and appreciate the split
(3) m ( r ) = m 0 ( r ) + U ( r ) + Ω ( r ) , (3) m ( r ) = m 0 ( r ) + U ( r ) + Ω ( r ) , {:(3)m(r)=m_(0)(r)+U(r)+Omega(r)",":}\begin{equation*} m(r)=m_{0}(r)+U(r)+\Omega(r), \tag{3} \end{equation*}(3)m(r)=m0(r)+U(r)+Ω(r),
proceed as follows. First split the total density of mass-energy, ρ ρ rho\rhoρ, into a part μ 0 n μ 0 n mu_(0)n\mu_{0} nμ0n due to rest mass - where μ 0 μ 0 mu_(0)\mu_{0}μ0 is the average rest mass of the baryonic species pres-

Box 23.1 (continued)

ent-and a part ρ μ 0 n ρ μ 0 n rho-mu_(0)n\rho-\mu_{0} nρμ0n due to internal thermal energy, compressional energy, etc. Next notice that the proper volume of a shell of thickness d r d r drd rdr is
(4) d V = 4 π r 2 ( e A d r ) = 4 π r 2 ( 1 2 m / r ) 1 / 2 d r (4) d V = 4 π r 2 e A d r = 4 π r 2 ( 1 2 m / r ) 1 / 2 d r {:(4)dV=4pir^(2)(e^(A)dr)=4pir^(2)(1-2m//r)^(-1//2)dr:}\begin{equation*} d \mathscr{V}=4 \pi r^{2}\left(e^{A} d r\right)=4 \pi r^{2}(1-2 m / r)^{-1 / 2} d r \tag{4} \end{equation*}(4)dV=4πr2(eAdr)=4πr2(12m/r)1/2dr
not 4 π r 2 d r 4 π r 2 d r 4pir^(2)dr4 \pi r^{2} d r4πr2dr. Consequently, the total rest mass inside radius r r rrr is
(5) m 0 = 0 r μ 0 n d V = 0 r 4 π r 2 ( 1 2 m / r ) 1 / 2 μ 0 n d r (5) m 0 = 0 r μ 0 n d V = 0 r 4 π r 2 ( 1 2 m / r ) 1 / 2 μ 0 n d r {:(5)m_(0)=int_(0)^(r)mu_(0)ndV=int_(0)^(r)4pir^(2)(1-2m//r)^(-1//2)mu_(0)ndr:}\begin{equation*} m_{0}=\int_{0}^{r} \mu_{0} n d \mathscr{V}=\int_{0}^{r} 4 \pi r^{2}(1-2 m / r)^{-1 / 2} \mu_{0} n d r \tag{5} \end{equation*}(5)m0=0rμ0ndV=0r4πr2(12m/r)1/2μ0ndr
and the total internal energy is
(6) U = 0 r ( ρ μ 0 n ) d V = 0 r 4 π r 2 ( 1 2 m / r ) 1 / 2 ( ρ μ 0 n ) d r (6) U = 0 r ρ μ 0 n d V = 0 r 4 π r 2 ( 1 2 m / r ) 1 / 2 ρ μ 0 n d r {:(6)U=int_(0)^(r)(rho-mu_(0)n)dV=int_(0)^(r)4pir^(2)(1-2m//r)^(-1//2)(rho-mu_(0)n)dr:}\begin{equation*} U=\int_{0}^{r}\left(\rho-\mu_{0} n\right) d \mathscr{V}=\int_{0}^{r} 4 \pi r^{2}(1-2 m / r)^{-1 / 2}\left(\rho-\mu_{0} n\right) d r \tag{6} \end{equation*}(6)U=0r(ρμ0n)dV=0r4πr2(12m/r)1/2(ρμ0n)dr
Subtract these from the total mass-energy, m m mmm; the quantity that is left must be the gravitational potential energy,
Ω = 0 r ρ [ ( 1 2 m / r ) 1 / 2 1 ] 4 π r 2 d r (7) 0 r ( ρ m / r ) 4 π r 2 d r [ Newtonian limit, m / r 1 ] Ω = 0 r ρ ( 1 2 m / r ) 1 / 2 1 4 π r 2 d r (7) 0 r ( ρ m / r ) 4 π r 2 d r [  Newtonian limit,  m / r 1 ] {:[Omega=-int_(0)^(r)rho[(1-2m//r)^(-1//2)-1]4pir^(2)dr],[(7)~~-int_(0)^(r)(rho m//r)4pir^(2)dr],[[" Newtonian limit, "m//r≪1]]:}\begin{align*} \Omega & =-\int_{0}^{r} \rho\left[(1-2 m / r)^{-1 / 2}-1\right] 4 \pi r^{2} d r \\ & \approx-\int_{0}^{r}(\rho m / r) 4 \pi r^{2} d r \tag{7}\\ & {[\text { Newtonian limit, } m / r \ll 1] } \end{align*}Ω=0rρ[(12m/r)1/21]4πr2dr(7)0r(ρm/r)4πr2dr[ Newtonian limit, m/r1]
(See exercise 23.7.)
Turn next to the r ^ r ^ r ^ r ^ hat(r) hat(r)\hat{r} \hat{r}r^r^ component of the field equations:
G r ^ r ^ = r 2 + r 2 e 2 Λ + 2 r 1 e 2 Λ d Φ / d r = 8 π T r ^ r ^ = 8 π p . G r ^ r ^ = r 2 + r 2 e 2 Λ + 2 r 1 e 2 Λ d Φ / d r = 8 π T r ^ r ^ = 8 π p . {:[G_( hat(r) hat(r))=-r^(-2)+r^(-2)e^(-2Lambda)+2r^(-1)e^(-2Lambda)d Phi//dr],[=8piT_( hat(r) hat(r))=8pi p.]:}\begin{aligned} G_{\hat{r} \hat{r}} & =-r^{-2}+r^{-2} e^{-2 \Lambda}+2 r^{-1} e^{-2 \Lambda} d \Phi / d r \\ & =8 \pi T_{\hat{r} \hat{r}}=8 \pi p . \end{aligned}Gr^r^=r2+r2e2Λ+2r1e2ΛdΦ/dr=8πTr^r^=8πp.
Solving this equation for the derivative of Φ Φ Phi\PhiΦ, and replacing e 2 Λ e 2 Λ e^(-2Lambda)e^{-2 \Lambda}e2Λ by 1 2 m / r 1 2 m / r 1-2m//r1-2 m / r12m/r, one obtains an expression for the gradient of the potential Φ Φ Phi\PhiΦ :
(23.21) d Φ d r = m + 4 π r 3 p r ( r 2 m ) . (23.21) d Φ d r = m + 4 π r 3 p r ( r 2 m ) . {:(23.21)(d Phi)/(dr)=(m+4pir^(3)p)/(r(r-2m)).:}\begin{equation*} \frac{d \Phi}{d r}=\frac{m+4 \pi r^{3} p}{r(r-2 m)} . \tag{23.21} \end{equation*}(23.21)dΦdr=m+4πr3pr(r2m).
This expression reduces to the familiar formula
(23.21~N) d Φ / d r = m / r 2 (23.21~N) d Φ / d r = m / r 2 {:(23.21~N)d Phi//dr=m//r^(2):}\begin{equation*} d \Phi / d r=m / r^{2} \tag{23.21~N} \end{equation*}(23.21~N)dΦ/dr=m/r2
in the Newtonian limit.
In most studies of stellar structure, one replaces equation (23.17) by the equivalent equation obtained with the help of (23.21),
(23.22) d p d r = ( ρ + p ) ( m + 4 π r 3 p ) r ( r 2 m ) (23.22) d p d r = ( ρ + p ) m + 4 π r 3 p r ( r 2 m ) {:(23.22)(dp)/(dr)=-((rho+p)(m+4pir^(3)p))/(r(r-2m)):}\begin{equation*} \frac{d p}{d r}=-\frac{(\rho+p)\left(m+4 \pi r^{3} p\right)}{r(r-2 m)} \tag{23.22} \end{equation*}(23.22)dpdr=(ρ+p)(m+4πr3p)r(r2m)
This is called the Oppenheimer-Volkoff ( OV ) ( OV ) (OV)(\mathrm{OV})(OV) equation of hydrostatic equilibrium. Its Newtonian limit,
(23.22~N) d p / d r = ρ m / r 2 (23.22~N) d p / d r = ρ m / r 2 {:(23.22~N)dp//dr=-rho m//r^(2):}\begin{equation*} d p / d r=-\rho m / r^{2} \tag{23.22~N} \end{equation*}(23.22~N)dp/dr=ρm/r2
is familiar.
Compare two stellar models, one relativistic and the other Newtonian. Suppose that at a given radius r r rrr [determined in both cases by (proper area) = 4 π r 2 = 4 π r 2 =4pir^(2)=4 \pi r^{2}=4πr2 ], the two configurations have the same values of ρ , p ρ , p rho,p\rho, pρ,p, and m m mmm. Then in the relativistic model the pressure gradient is
d p d ( proper radial distance ) = d p e A d r (23.23) = ( ρ + p ) ( m + 4 π r 3 p ) r 2 ( 1 2 m / r ) 1 / 2 . d p d (  proper radial distance  ) = d p e A d r (23.23) = ( ρ + p ) m + 4 π r 3 p r 2 ( 1 2 m / r ) 1 / 2 . {:[(dp)/(d(" proper radial distance "))=(dp)/(e^(A)dr)],[(23.23)=-((rho+p)(m+4pir^(3)p))/(r^(2)(1-2m//r)^(1//2)).]:}\begin{align*} \frac{d p}{d(\text { proper radial distance })} & =\frac{d p}{e^{A} d r} \\ & =-\frac{(\rho+p)\left(m+4 \pi r^{3} p\right)}{r^{2}(1-2 m / r)^{1 / 2}} . \tag{23.23} \end{align*}dpd( proper radial distance )=dpeAdr(23.23)=(ρ+p)(m+4πr3p)r2(12m/r)1/2.
In contrast, Newtonian theory gives for the pressure gradient
(23.23~N) d p d ( proper radial distance ) = d p d r = ρ m r 2 (23.23~N) d p d (  proper radial distance  ) = d p d r = ρ m r 2 {:(23.23~N)(dp)/(d(" proper radial distance "))=(dp)/(dr)=-(rho m)/(r^(2)):}\begin{equation*} \frac{d p}{d(\text { proper radial distance })}=\frac{d p}{d r}=-\frac{\rho m}{r^{2}} \tag{23.23~N} \end{equation*}(23.23~N)dpd( proper radial distance )=dpdr=ρmr2
The relativistic expression for the gradient is larger than the Newtonian expression (1) because the numerator is larger (added pressure term in both factors) and (2) because the denominator is smaller [shrinkage factor ( 1 2 m / r ) 1 / 2 ( 1 2 m / r ) 1 / 2 (1-2m//r)^(1//2)(1-2 m / r)^{1 / 2}(12m/r)1/2 ]. Therefore, as one proceeds deeper into the star, one finds pressure rising faster than Newtonian gravitation theory would predict. Moreover, this rise in pressure is in a certain sense "self-regenerative." The more the pressure goes up, the larger the pressure-correction terms become in the numerator of (23.23); and the larger these terms become, the faster is the further rise of the pressure as one probes still deeper into the star. The geometric factor [ 1 2 m ( r ) / r ] 1 / 2 [ 1 2 m ( r ) / r ] 1 / 2 [1-2m(r)//r]^(1//2)[1-2 m(r) / r]^{1 / 2}[12m(r)/r]1/2 in the denominator of (23.23) further augments this regenerative rise of pressure towards the center. It is appropriate to summarize the situation in short-hand terms by saying that general relativity predicts stronger gravitational forces in a stationary body than does Newtonian theory. These forces, among their other important effects, can pull certain white-dwarf stars and supermassive stars into gravitational collapse under circumstances (see Chapter 24) where Newtonian theory would have predicted stable hydrostatic equilibrium. As the most elementary indication that a new factor has surfaced in the analysis of stability, note that no star in hydrostatic equilibrium can ever have 2 m ( r ) / r 1 2 m ( r ) / r 1 2m(r)//r >= 12 m(r) / r \geq 12m(r)/r1 (see Box 23.2 for one illustration and § 23.8 § 23.8 §23.8\S 23.8§23.8 for discussion), a phenomenon alien to Newtonian theory.
Now in hand are five equations of structure [two equations of state (23.16); equation (23.19), expressing m ( r ) = 1 2 r ( 1 e 2 Λ ) m ( r ) = 1 2 r 1 e 2 Λ m(r)=(1)/(2)r(1-e^(-2Lambda))m(r)=\frac{1}{2} r\left(1-e^{-2 \Lambda}\right)m(r)=12r(1e2Λ) as a volume integral of ρ ρ rho\rhoρ; the source
Equation of hydrostatic equilibrium rewritten in "OV" form
Comparison of pressure gradients in Newtonian and relativistic stars
Equations of stellar structure summarized
equation (23.21) for Φ Φ Phi\PhiΦ; and the OV equation of hydrostatic equilibrium (23.22)] for the five structure functions ρ , p , n , Φ , Λ ρ , p , n , Φ , Λ rho,p,n,Phi,Lambda\rho, p, n, \Phi, \Lambdaρ,p,n,Φ,Λ. If the theory of relativistic stars as outlined above is well posed, then each of the remaining eight Einstein field equations G α ^ β ^ = 8 π T α ^ β ^ G α ^ β ^ = 8 π T α ^ β ^ G_( hat(alpha) hat(beta))=8piT_( hat(alpha) hat(beta))G_{\hat{\alpha} \hat{\beta}}=8 \pi T_{\hat{\alpha} \hat{\beta}}Gα^β^=8πTα^β^ must be either vacuous (" 0 = 0 0 = 0 0=00=00=0 "), or must be a consequence of the five equations of structure. This is, indeed, the case, as one can verify by straightforward but tedious computations.
To construct a stellar model, one needs boundary conditions as well as structure equations. To facilitate the presentation of boundary conditions, the next section will examine the star's external gravitational field.

EXERCISES

Exercise 23.3. LAW OF LOCAL ENERGY-MOMENTUM CONSERVATION (for readers who have not studied Chapter 22)
Evaluate the four components of the equation T α β ; β = 0 T α β ; β = 0 T^(alpha beta)_(;beta)=0T^{\alpha \beta}{ }_{; \beta}=0Tαβ;β=0 for the stress-energy tensor (23.14) in the Schwarzschild coordinate system of equation (23.7). [Answer: only T τ β ; β = 0 T τ β ; β = 0 T^(tau beta)_(;beta)=0T^{\tau \beta}{ }_{; \beta}=0Tτβ;β=0 gives a nonvacuous result; it gives equation (23.17).]

Exercise 23.4. EINSTEIN CURVATURE TENSOR

(for readers who have not studied Chapter 14)
Calculate the components of the Einstein curvature tensor, G α β G α β G_(alpha beta)G_{\alpha \beta}Gαβ, in Schwarzschild coordinates. Then perform a transformation to obtain G α ^ β ^ G α ^ β ^ G_( hat(alpha) hat(beta))G_{\hat{\alpha} \hat{\beta}}Gα^β^, the components in the orthonormal frame of equations (23.15a,b). [See Box 8.6, or Box 14.2 and equation (14.7).]

Exercise 23.5. TOTAL NUMBER OF BARYONS IN A STAR

Show that, if r = R r = R r=Rr=Rr=R is the location of the surface of a static star, then the total number of baryons inside the star is
(23.24) A = 0 R 4 π r 2 n e A d r (23.24) A = 0 R 4 π r 2 n e A d r {:(23.24)A=int_(0)^(R)4pir^(2)ne^(A)dr:}\begin{equation*} A=\int_{0}^{R} 4 \pi r^{2} n e^{A} d r \tag{23.24} \end{equation*}(23.24)A=0R4πr2neAdr
[Hint: See the discussion of m 0 m 0 m_(0)m_{0}m0 in Box 23.1.]

Exercise 23.6. BUOYANT FORCE IN A STAR

An observer at rest at some point inside a relativistic star measures the radial pressure-buoyant force, F buoy F buoy  F_("buoy ")F_{\text {buoy }}Fbuoy , on a small fluid element of volume V V VVV. Let him use the usual laboratory techniques. Do not confuse him by telling him he is in a relativistic star. What value will he find for F buoy F buoy  F_("buoy ")F_{\text {buoy }}Fbuoy , in terms of ρ , p , m , V ρ , p , m , V rho,p,m,V\rho, p, m, Vρ,p,m,V, and d p / d r d p / d r dp//drd p / d rdp/dr ? If he equates this buoyant force to an equal and opposite gravitational force, F grav F grav  F_("grav ")F_{\text {grav }}Fgrav , what will F grav F grav  F_("grav ")F_{\text {grav }}Fgrav  be in terms of ρ , p , m , V ρ , p , m , V rho,p,m,V\rho, p, m, Vρ,p,m,V, and r r rrr ? (Use equation 23.22.) How do these results differ from the corresponding Newtonian results?

Exercise 23.7. GRAVITATIONAL ENERGY OF A NEWTONIAN STAR

Calculate in Newtonian theory the energy one would gain from gravity if one were to construct a star by adding one spherical shell of matter on top of another, working from the inside outward. Use Laplace's equation ( r 2 Φ , r ) , r = 4 π r 2 ρ r 2 Φ , r , r = 4 π r 2 ρ (r^(2)Phi_(,r))_(,r)=4pir^(2)rho\left(r^{2} \Phi{ }_{, r}\right)_{, r}=4 \pi r^{2} \rho(r2Φ,r),r=4πr2ρ and the equation of hydrostatic equilibrium p , r = ρ Φ , r p , r = ρ Φ , r p_(,r)=-rhoPhi_(,r)p_{, r}=-\rho \Phi_{, r}p,r=ρΦ,r to put the answer in the following equivalent forms:
(energy gained from gravity) -=-\equiv- (gravitational potential energy)
= 0 R ( ρ r Φ , r ) 4 π r 2 d r = 0 R ( ρ m / r ) 4 π r 2 d r = 1 2 0 R ( ρ Φ ) 4 π r 2 d r = 1 8 π 0 ( Φ , r ) 2 4 π r 2 d r = 3 0 R 4 π r 2 p d r . = 0 R ρ r Φ , r 4 π r 2 d r = 0 R ( ρ m / r ) 4 π r 2 d r = 1 2 0 R ( ρ Φ ) 4 π r 2 d r = 1 8 π 0 Φ , r 2 4 π r 2 d r = 3 0 R 4 π r 2 p d r . {:[=int_(0)^(R)(rho rPhi_(,r))4pir^(2)dr=int_(0)^(R)(rho m//r)4pir^(2)dr],[=-(1)/(2)int_(0)^(R)(rho Phi)4pir^(2)dr=(1)/(8pi)int_(0)^(oo)(Phi_(,r))^(2)4pir^(2)dr],[=3int_(0)^(R)4pir^(2)pdr.]:}\begin{aligned} & =\int_{0}^{R}\left(\rho r \Phi_{, r}\right) 4 \pi r^{2} d r=\int_{0}^{R}(\rho m / r) 4 \pi r^{2} d r \\ & =-\frac{1}{2} \int_{0}^{R}(\rho \Phi) 4 \pi r^{2} d r=\frac{1}{8 \pi} \int_{0}^{\infty}\left(\Phi_{, r}\right)^{2} 4 \pi r^{2} d r \\ & =3 \int_{0}^{R} 4 \pi r^{2} p d r . \end{aligned}=0R(ρrΦ,r)4πr2dr=0R(ρm/r)4πr2dr=120R(ρΦ)4πr2dr=18π0(Φ,r)24πr2dr=30R4πr2pdr.

§23.6. EXTERNAL GRAVITATIONAL FIELD

Outside a star the density and pressure vanish, so only the metric parameters Φ Φ Phi\PhiΦ and Λ = 1 2 ln ( 1 2 m / r ) Λ = 1 2 ln ( 1 2 m / r ) Lambda=-(1)/(2)ln(1-2m//r)\Lambda=-\frac{1}{2} \ln (1-2 m / r)Λ=12ln(12m/r) need be considered. From equation (23.19) one sees that "the mass inside radius r r rrr, " m ( r ) m ( r ) m(r)m(r)m(r), stays constant for values of r r rrr greater than R R RRR (outside the star). Its constant value is denoted by M M MMM :
(23.25) m ( r ) = M for r > R (i.e., outside the star). (23.25) m ( r ) = M  for  r > R  (i.e., outside the star).  {:(23.25)m(r)=M quad" for "r > R" (i.e., outside the star). ":}\begin{equation*} m(r)=M \quad \text { for } r>R \text { (i.e., outside the star). } \tag{23.25} \end{equation*}(23.25)m(r)=M for r>R (i.e., outside the star). 
By integrating equation (23.21) with p = 0 p = 0 p=0p=0p=0 and m = M m = M m=Mm=Mm=M, and by imposing the boundary condition (23.10) on Φ Φ Phi\PhiΦ at r = r = r=oor=\inftyr= ("normalization of scale of time at r = r = r=oor=\inftyr= "), one finds
(23.26) Φ ( r ) = 1 2 ln ( 1 2 M / r ) for r > R . (23.26) Φ ( r ) = 1 2 ln ( 1 2 M / r )  for  r > R . {:(23.26)Phi(r)=(1)/(2)ln(1-2M//r)quad" for "r > R.:}\begin{equation*} \Phi(r)=\frac{1}{2} \ln (1-2 M / r) \quad \text { for } r>R . \tag{23.26} \end{equation*}(23.26)Φ(r)=12ln(12M/r) for r>R.
Consequently, outside the star the spacetime geometry (23.7) becomes
(23.27) d s 2 = ( 1 2 M r ) d t 2 + d r 2 ( 1 2 M / r ) + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) (23.27) d s 2 = 1 2 M r d t 2 + d r 2 ( 1 2 M / r ) + r 2 d θ 2 + sin 2 θ d ϕ 2 {:(23.27)ds^(2)=-(1-(2M)/(r))dt^(2)+(dr^(2))/((1-2M//r))+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2)):}\begin{equation*} d s^{2}=-\left(1-\frac{2 M}{r}\right) d t^{2}+\frac{d r^{2}}{(1-2 M / r)}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \tag{23.27} \end{equation*}(23.27)ds2=(12Mr)dt2+dr2(12M/r)+r2(dθ2+sin2θdϕ2)
This is called the "Schwarzschild geometry" or "Schwarzschild gratitational field" or "Schwarzschild line element," because Karl Schwarzschild (1916a) discovered it as an exact solution to Einstein's field equations a few months after Einstein formulated general relativity theory.
In that region of spacetime, r 2 M r 2 M r≫2Mr \gg 2 Mr2M, where the geometry is nearly flat, Newton's theory of gravity is valid, and the Newtonian potential is
(23.26~N) Φ = M / r for r > R , r 2 M . (23.26~N) Φ = M / r  for  r > R , r 2 M . {:(23.26~N)Phi=-M//r quad" for "r > R","r≫2M.:}\begin{equation*} \Phi=-M / r \quad \text { for } r>R, r \gg 2 M . \tag{23.26~N} \end{equation*}(23.26~N)Φ=M/r for r>R,r2M.
Consequently, M M MMM is the mass that governs the Keplerian motions of planets in the distant, Newtonian gravitational field-i.e., it is the star's "total mass-energy" (see Chapters 19 and 20). Since the metric (23.27) far outside the star is precisely diagonal ( g t j 0 ) g t j 0 (g_(tj)-=0)\left(g_{t j} \equiv 0\right)(gtj0), the star's total angular momentum must vanish. This result accords with the absence of internal fluid motions.
Spacetime outside star possesses "Schwarzschild" geometry

§23.7. HOW TO CONSTRUCT A STELLAR MODEL

Equations of stellar structure collected together
The equations of stellar structure (23.16), (23.19), (23.21), (23.22), and associated boundary conditions (to be discussed below), all gathered together along with the line element, read as follows.

Line Element

( ) d s 2 = e 2 ϕ d t 2 + d r 2 1 2 m / r + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) = ( 1 2 M r ) d t 2 + d r 2 1 2 M / r + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) for r > R . ( ) d s 2 = e 2 ϕ d t 2 + d r 2 1 2 m / r + r 2 d θ 2 + sin 2 θ d ϕ 2 = 1 2 M r d t 2 + d r 2 1 2 M / r + r 2 d θ 2 + sin 2 θ d ϕ 2  for  r > R . {:[('")"ds^(2)=-e^(2phi)dt^(2)+(dr^(2))/(1-2m//r)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))],[=-(1-(2M)/(r))dt^(2)+(dr^(2))/(1-2M//r)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))quad" for "r > R.]:}\begin{align*} d s^{2} & =-e^{2 \phi} d t^{2}+\frac{d r^{2}}{1-2 m / r}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \tag{$\prime$}\\ & =-\left(1-\frac{2 M}{r}\right) d t^{2}+\frac{d r^{2}}{1-2 M / r}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \quad \text { for } r>R . \end{align*}()ds2=e2ϕdt2+dr212m/r+r2(dθ2+sin2θdϕ2)=(12Mr)dt2+dr212M/r+r2(dθ2+sin2θdϕ2) for r>R.

Mass Equation

(23.28a) m = 0 r 4 π r 2 ρ d r , with m ( r = 0 ) = 0 (23.28a) m = 0 r 4 π r 2 ρ d r ,  with  m ( r = 0 ) = 0 {:(23.28a)m=int_(0)^(r)4pir^(2)rho dr","" with "m(r=0)=0:}\begin{equation*} m=\int_{0}^{r} 4 \pi r^{2} \rho d r, \text { with } m(r=0)=0 \tag{23.28a} \end{equation*}(23.28a)m=0r4πr2ρdr, with m(r=0)=0

OV Equation of Hydrostatic Equilibrium

(23.28b) d p d r = ( ρ + p ) ( m + 4 π r 3 p ) r ( r 2 m ) , with p ( r = 0 ) = p c = central pressure. (23.28b) d p d r = ( ρ + p ) m + 4 π r 3 p r ( r 2 m ) , with  p ( r = 0 ) = p c =  central pressure.  {:(23.28b)(dp)/(dr)=-((rho+p)(m+4pir^(3)p))/(r(r-2m))", with "p(r=0)=p_(c)=" central pressure. ":}\begin{equation*} \frac{d p}{d r}=-\frac{(\rho+p)\left(m+4 \pi r^{3} p\right)}{r(r-2 m)} \text {, with } p(r=0)=p_{c}=\text { central pressure. } \tag{23.28b} \end{equation*}(23.28b)dpdr=(ρ+p)(m+4πr3p)r(r2m), with p(r=0)=pc= central pressure. 

Equations of State

(23.28c) p = p ( n ) , (23.28d) ρ = ρ ( n ) . (23.28c) p = p ( n ) , (23.28d) ρ = ρ ( n ) . {:[(23.28c)p=p(n)","],[(23.28d)rho=rho(n).]:}\begin{align*} & p=p(n), \tag{23.28c}\\ & \rho=\rho(n) . \tag{23.28d} \end{align*}(23.28c)p=p(n),(23.28d)ρ=ρ(n).

Source Equation for Φ Φ Phi\PhiΦ

(23.28e) d Φ d r = ( m + 4 π r 3 p ) r ( r 2 m ) , with Φ ( r = R ) = 1 2 ln ( 1 2 M / R ) . (23.28e) d Φ d r = m + 4 π r 3 p r ( r 2 m ) ,  with  Φ ( r = R ) = 1 2 ln ( 1 2 M / R ) . {:(23.28e)(d Phi)/(dr)=((m+4pir^(3)p))/(r(r-2m))","quad" with "Phi(r=R)=(1)/(2)ln(1-2M//R).:}\begin{equation*} \frac{d \Phi}{d r}=\frac{\left(m+4 \pi r^{3} p\right)}{r(r-2 m)}, \quad \text { with } \Phi(r=R)=\frac{1}{2} \ln (1-2 M / R) . \tag{23.28e} \end{equation*}(23.28e)dΦdr=(m+4πr3p)r(r2m), with Φ(r=R)=12ln(12M/R).
To construct a stellar model one can proceed as follows. First specify the equations of state ( 23.28 c , d ) ( 23.28 c , d ) (23.28c,d)(23.28 \mathrm{c}, \mathrm{d})(23.28c,d) and a value of the central pressure, p c p c p_(c)p_{c}pc. Also specify an arbitrary (later to be renormalized) value, Φ 0 Φ 0 Phi_(0)\Phi_{0}Φ0, for Φ ( r = 0 ) Φ ( r = 0 ) Phi(r=0)\Phi(r=0)Φ(r=0). The boundary conditions p ( r = 0 ) = p c , Φ ( r = 0 ) = Φ 0 , m ( r = 0 ) = 0 p ( r = 0 ) = p c , Φ ( r = 0 ) = Φ 0 , m ( r = 0 ) = 0 p(r=0)=p_(c),Phi(r=0)=Phi_(0),m(r=0)=0p(r=0)=p_{c}, \Phi(r=0)=\Phi_{0}, m(r=0)=0p(r=0)=pc,Φ(r=0)=Φ0,m(r=0)=0 are sufficient to determine uniquely the solution to the coupled equations (23.28). Integrate these coupled equations outward from r = 0 r = 0 r=0r=0r=0 until the pressure vanishes. [The OV equation, (23.28b), guarantees that the pressure will decrease monotonically so long as the equations of state obey the
reasonable restriction ρ 0 ρ 0 rho >= 0\rho \geq 0ρ0 for all p 0 p 0 p >= 0p \geq 0p0.] The point at which the pressure reaches zero is the star's surface; the value of r r rrr there is the star's radius, R R RRR; and the value of m m mmm there is the star's total mass-energy, M M MMM. Having reached the surface, renormalize Φ Φ Phi\PhiΦ by adding a constant to it everywhere, so that it obeys the boundary condition (23.28e). The result is a relativistic stellar model whose structure functions Φ , m Φ , m Phi,m\Phi, mΦ,m, ρ , p , n ρ , p , n rho,p,n\rho, p, nρ,p,n satisfy the equations of structure.
Notice that for any fixed choice of the equations of state p = p ( n ) , ρ = ρ ( n ) p = p ( n ) , ρ = ρ ( n ) p=p(n),rho=rho(n)p=p(n), \rho=\rho(n)p=p(n),ρ=ρ(n), the stellar models form a one-parameter sequence (parameter p c p c p_(c)p_{c}pc ). Once the central pressure has been specified, the model is determined uniquely.
The next chapter describes a variety of realistic stellar models constructed numerically by the above prescription. For an idealized stellar model constructed analytically, see Box 23.2.

Exercise 23.8. NEWTONIAN STARS OF UNIFORM DENSITY

EXERCISE

Calculate the structures of uniform-density configurations in Newtonian theory. Show that the relativistic configurations of Box 23.2 become identical to the Newtonian configurations in the weak-gravity limit. Also show that there are no mass or radius limits in Newtonian theory.
(continued on page 612)

Box 23.2 RELATIVISTIC MODEL STAR OF UNIFORM DENSITY

For realistic equations of state (see next chapter), the equations of stellar structure (23.28) cannot be integrated analytically; numerical integration is necessary. However, analytic solutions exist for various idealized and ad hoc equations of state. One of the most useful analytic solutions [Karl Schwarzschild (1916b)] describes a star of uniform density,
(1) ρ = ρ 0 = constant for all p . (1) ρ = ρ 0 =  constant for all  p . {:(1)rho=rho_(0)=" constant for all "p.:}\begin{equation*} \rho=\rho_{0}=\text { constant for all } p . \tag{1} \end{equation*}(1)ρ=ρ0= constant for all p.
It is not necessary to indulge in the fiction of "an incompressible fluid" to accept this model as interesting. Incompressibility would imply a speed of sound, v = ( d p / d ρ ) 1 / 2 v = ( d p / d ρ ) 1 / 2 v=(dp//d rho)^(1//2)v=(d p / d \rho)^{1 / 2}v=(dp/dρ)1/2, of unlimited magnitude, therefore in excess of the speed of light, and therefore in contradiction with a central principle of special relativity ("principle of causality") that no physical effect can be propagated at a speed v > 1 v > 1 v > 1v>1v>1. (If a source could cause an effect so quickly in one local Lorentz frame, then there would exist another local Lorentz frame in which the effect would occur before the source had acted!) However, that the part of the fluid in the region of high pressure has the same density as the part of the fluid in the region of low pressure is an idea easy to admit, if only one thinks of the fluid having a composition that varies from one

Box 23.2 (continued)

r r rrr value to another ("hand-tailored"). Whether one thinks along this line, or simply has in mind a globe of water limited in size to a small fraction of the dimensions of the earth, one has in Schwarzschild's model an instructive example of hydrostatics done in the framework of Einstein's theory.
The mass equation (23.28a) gives immediately
(2) m = { ( 4 π / 3 ) ρ 0 r 3 for r < R M = ( 4 π / 3 ) ρ 0 R 3 for r > R } . (2) m = ( 4 π / 3 ) ρ 0 r 3  for  r < R M = ( 4 π / 3 ) ρ 0 R 3  for  r > R . {:(2)m={[(4pi//3)rho_(0^('))r^(3)," for "r < R],[M=(4pi//3)rho_(0)R^(3)," for "r > R]}.:}m=\left\{\begin{array}{ll} (4 \pi / 3) \rho_{0^{\prime}} r^{3} & \text { for } r<R \tag{2}\\ M=(4 \pi / 3) \rho_{0} R^{3} & \text { for } r>R \end{array}\right\} .(2)m={(4π/3)ρ0r3 for r<RM=(4π/3)ρ0R3 for r>R}.
from which follows the length-correction factor in the metric
(3) d ( proper distance ) d r = e Λ = [ 1 2 m ( r ) / r ] 1 / 2 . (3) d (  proper distance  ) d r = e Λ = [ 1 2 m ( r ) / r ] 1 / 2 . {:(3)(d(" proper distance "))/(dr)=e^(Lambda)=[1-2m(r)//r]^(-1//2).:}\begin{equation*} \frac{d(\text { proper distance })}{d r}=e^{\Lambda}=[1-2 m(r) / r]^{-1 / 2} . \tag{3} \end{equation*}(3)d( proper distance )dr=eΛ=[12m(r)/r]1/2.
When for ease of visualization the space geometry ( r , ϕ ) ( r , ϕ ) (r,phi)(r, \phi)(r,ϕ) of an equatorial slice through the star is viewed as embedded in a Euclidean 3-geometry ( z , r , ϕ ) ( z , r , ϕ ) (z,r,phi)(z, r, \phi)(z,r,ϕ) [see $ 23.8 ] $ 23.8 ] $23.8]\$ 23.8]$23.8], the "lift" out of the plane z = 0 z = 0 z=0z=0z=0 is
z ( r ) = { ( R 3 / 2 M ) 1 / 2 [ 1 ( 1 2 M r 2 / R 3 ) 1 / 2 ] for r R , ( R 3 / 2 M ) 1 / 2 [ 1 ( 1 2 M / R ) 1 / 2 ] + [ 8 M ( r 2 M ) ] 1 / 2 [ 8 M ( R 2 M ) ] 1 / 2 (4) for r R . z ( r ) = R 3 / 2 M 1 / 2 1 1 2 M r 2 / R 3 1 / 2  for  r R , R 3 / 2 M 1 / 2 1 ( 1 2 M / R ) 1 / 2 + [ 8 M ( r 2 M ) ] 1 / 2 [ 8 M ( R 2 M ) ] 1 / 2 (4)  for  r R {:[z(r)={[(R^(3)//2M)^(1//2)[1-(1-2Mr^(2)//R^(3))^(1//2)]quad" for "r <= R","],[(R^(3)//2M)^(1//2)[1-(1-2M//R)^(1//2)]+[8M(r-2M)]^(1//2)-[8M(R-2M)]^(1//2)]:}],[(4)" for "r >= R". "]:}\begin{align*} & z(r)=\left\{\begin{array}{l} \left(R^{3} / 2 M\right)^{1 / 2}\left[1-\left(1-2 M r^{2} / R^{3}\right)^{1 / 2}\right] \quad \text { for } r \leq R, \\ \left(R^{3} / 2 M\right)^{1 / 2}\left[1-(1-2 M / R)^{1 / 2}\right]+[8 M(r-2 M)]^{1 / 2}-[8 M(R-2 M)]^{1 / 2} \end{array}\right. \\ & \text { for } r \geq R \text {. } \tag{4} \end{align*}z(r)={(R3/2M)1/2[1(12Mr2/R3)1/2] for rR,(R3/2M)1/2[1(12M/R)1/2]+[8M(r2M)]1/2[8M(R2M)]1/2(4) for rR
The knowledge of m ( r ) m ( r ) m(r)m(r)m(r) from (2) allows the equation of hydrostatic equilibrium ( 23.28 b ) to be integrated to give the pressure:
(5) p = ρ 0 { ( 1 2 M r 2 / R 3 ) 1 / 2 ( 1 2 M / R ) 1 / 2 3 ( 1 2 M / R ) 1 / 2 ( 1 2 M r 2 / R 3 ) 1 / 2 } for r < R . (5) p = ρ 0 1 2 M r 2 / R 3 1 / 2 ( 1 2 M / R ) 1 / 2 3 ( 1 2 M / R ) 1 / 2 1 2 M r 2 / R 3 1 / 2  for  r < R . {:(5)p=rho_(0){((1-2Mr^(2)//R^(3))^(1//2)-(1-2M//R)^(1//2))/(3(1-2M//R)^(1//2)-(1-2Mr^(2)//R^(3))^(1//2))}" for "r < R.:}\begin{equation*} p=\rho_{0}\left\{\frac{\left(1-2 M r^{2} / R^{3}\right)^{1 / 2}-(1-2 M / R)^{1 / 2}}{3(1-2 M / R)^{1 / 2}-\left(1-2 M r^{2} / R^{3}\right)^{1 / 2}}\right\} \text { for } r<R . \tag{5} \end{equation*}(5)p=ρ0{(12Mr2/R3)1/2(12M/R)1/23(12M/R)1/2(12Mr2/R3)1/2} for r<R.
The pressure in turn leads via (23.28e) to the time-correction factor in the metric.
d (proper time) d t = e ϕ = { 3 2 ( 1 2 M R ) 1 / 2 1 2 ( 1 2 M r 2 R 3 ) 1 / 2 for r < R ( 1 2 M / r ) 1 / 2 for r > R } d  (proper time)  d t = e ϕ = 3 2 1 2 M R 1 / 2 1 2 1 2 M r 2 R 3 1 / 2       for  r < R ( 1 2 M / r ) 1 / 2       for  r > R (d" (proper time) ")/(dt)=e^(phi)={[(3)/(2)(1-(2M)/(R))^(1//2)-(1)/(2)(1-(2Mr^(2))/(R^(3)))^(1//2)," for "r < R],[(1-2M//r)^(1//2)," for "r > R]}\frac{d \text { (proper time) }}{d t}=e^{\phi}=\left\{\begin{array}{ll}\frac{3}{2}\left(1-\frac{2 M}{R}\right)^{1 / 2}-\frac{1}{2}\left(1-\frac{2 M r^{2}}{R^{3}}\right)^{1 / 2} & \text { for } r<R \\ (1-2 M / r)^{1 / 2} & \text { for } r>R\end{array}\right\}d (proper time) dt=eϕ={32(12MR)1/212(12Mr2R3)1/2 for r<R(12M/r)1/2 for r>R}.
Several features of these uniform-density configurations are noteworthy. (1) For fixed energy density, ρ 0 ρ 0 rho_(0)\rho_{0}ρ0, the central pressure
(7) p c = ρ 0 { 1 ( 1 2 M / R ) 1 / 2 3 ( 1 2 M / R ) 1 / 2 1 } , (7) p c = ρ 0 1 ( 1 2 M / R ) 1 / 2 3 ( 1 2 M / R ) 1 / 2 1 , {:(7)p_(c)=rho_(0){(1-(1-2M//R)^(1//2))/(3(1-2M//R)^(1//2)-1)}",":}\begin{equation*} p_{c}=\rho_{0}\left\{\frac{1-(1-2 M / R)^{1 / 2}}{3(1-2 M / R)^{1 / 2}-1}\right\}, \tag{7} \end{equation*}(7)pc=ρ0{1(12M/R)1/23(12M/R)1/21},
increases monotonically as the radius, R R RRR, increases-and, hence, also as the mass, M = ( 4 π / 3 ) ρ 0 R 3 M = ( 4 π / 3 ) ρ 0 R 3 M=(4pi//3)rho_(0)R^(3)M=(4 \pi / 3) \rho_{0} R^{3}M=(4π/3)ρ0R3, and the ratio ("strength of gravity")
(8) 2 M / R = ( 8 π / 3 ) ρ 0 R 2 (8) 2 M / R = ( 8 π / 3 ) ρ 0 R 2 {:(8)2M//R=(8pi//3)rho_(0)R^(2):}\begin{equation*} 2 M / R=(8 \pi / 3) \rho_{0} R^{2} \tag{8} \end{equation*}(8)2M/R=(8π/3)ρ0R2
increase. This is natural, since, as more and more matter is added to the star, a greater and greater pressure is required to support it. (2) The central pressure becomes infinite when M , R M , R M,RM, RM,R, and 2 M / R 2 M / R 2M//R2 M / R2M/R reach the limiting values
(9) R lim = ( 9 / 4 ) M lim = ( 3 π ρ 0 ) 1 / 2 , (10) ( 2 M / R ) lim = 8 / 9 . (9) R lim = ( 9 / 4 ) M lim = 3 π ρ 0 1 / 2 , (10) ( 2 M / R ) lim = 8 / 9 . {:[(9)R_(lim)=(9//4)M_(lim)=(3pirho_(0))^(-1//2)","],[(10)(2M//R)_(lim)=8//9.]:}\begin{align*} R_{\lim }= & (9 / 4) M_{\lim }=\left(3 \pi \rho_{0}\right)^{-1 / 2}, \tag{9}\\ & (2 M / R)_{\lim }=8 / 9 . \tag{10} \end{align*}(9)Rlim=(9/4)Mlim=(3πρ0)1/2,(10)(2M/R)lim=8/9.
No star of uniform density can have a mass and radius exceeding these limits. These limits are purely relativistic phenomena; no such limits occur in Newtonian theory. (3) Inside the star the space geometry (geometry of a hypersurface t = t = t=t=t= constant) is that of a three-dimensional spherical surface with radius of curvature
(11) a = ( 3 / 8 π ρ 0 ) 1 / 2 . (11) a = 3 / 8 π ρ 0 1 / 2 . {:(11)a=(3//8pirho_(0))^(1//2).:}\begin{equation*} a=\left(3 / 8 \pi \rho_{0}\right)^{1 / 2} . \tag{11} \end{equation*}(11)a=(3/8πρ0)1/2.
[See equation (4), above.] Outside the star the (Schwarzschild) space geometry is that of a three-dimensional paraboloid of revolution. The interior and exterior geometries join together smoothly. All these details are shown in the following three diagrams. There all quantities are given in the following geometric units (to convert mass in g or density in g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3 into mass in cm or density in cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2, multiply by 0.742 × 10 28 cm / g ) 0.742 × 10 28 cm / g {: 0.742 xx10^(-28)(cm)//g)\left.0.742 \times 10^{-28} \mathrm{~cm} / \mathrm{g}\right)0.742×1028 cm/g) : lengths, in units ( 3 / 8 π ρ 0 ) 1 / 2 3 / 8 π ρ 0 1 / 2 (3//8pirho_(0))^(1//2)\left(3 / 8 \pi \rho_{0}\right)^{1 / 2}(3/8πρ0)1/2; pressure, in units ρ 0 ρ 0 rho_(0)\rho_{0}ρ0; mass, in units ( 3 / 32 π ρ 0 ) 1 / 2 3 / 32 π ρ 0 1 / 2 (3//32 pirho_(0))^(1//2)\left(3 / 32 \pi \rho_{0}\right)^{1 / 2}(3/32πρ0)1/2.

Box 23.2 (continued)

The mass "after assembly" is what is called M M MMM. The mass of the same fluid, dispersed in droplets at infinite separation, is called M before M before  M_("before ")M_{\text {before }}Mbefore  in the following table.
M before M before  M_("before ")M_{\text {before }}Mbefore  small 0.0882 0.894 1.0913 1.374
M M MMM small 0.0828 0.636 0.729 0.838 (critical)
Difference
(binding):
Difference (binding):| Difference | | :--- | | (binding): |
3 10 M 5 / 3 3 10 M 5 / 3 (3)/(10)M^(5//3)\frac{3}{10} M^{5 / 3}310M5/3 0.0054 0.258 0.362 0.536
M_("before ") small 0.0882 0.894 1.0913 1.374 M small 0.0828 0.636 0.729 0.838 (critical) "Difference (binding):" (3)/(10)M^(5//3) 0.0054 0.258 0.362 0.536| $M_{\text {before }}$ | small | 0.0882 | 0.894 | 1.0913 | 1.374 | | :--- | :--- | :--- | :--- | :--- | :--- | | $M$ | small | 0.0828 | 0.636 | 0.729 | 0.838 (critical) | | Difference <br> (binding): | $\frac{3}{10} M^{5 / 3}$ | 0.0054 | 0.258 | 0.362 | 0.536 |

§23.8. THE SPACETIME GEOMETRY FOR A STATIC STAR

Surface area of spheres, 4 π r 2 4 π r 2 4pir^(2)4 \pi r^{2}4πr2 :
(1) increases monotonically from center of star outward
For a highly relativistic star, the spacetime geometry departs strongly from EuclidLorentz flatness. Consequently, there is no a priori reason to expect that the surface area 4 π r 2 4 π r 2 4pir^(2)4 \pi r^{2}4πr2, and hence also the radial coordinate r r rrr, will increase monotonically as one moves from the center of the star outward. Fortunately, the equations of stellar structure guarantee that r will increase monotonically from 0 at the star's center to oo\infty at an infinite distance away from the star, so long as ρ 0 ρ 0 rho >= 0\rho \geq 0ρ0 and so long as the star is static (equilibrium).
The monotonicity of r r rrr can be seen as follows. Introduce as a new radial coordinate proper distance, \ell, from the center of the star. By virtue of expression (23.27') for the metric, \ell and r r rrr are related by
(23.29) d r = ± ( 1 2 m / r ) 1 / 2 d . (23.29) d r = ± ( 1 2 m / r ) 1 / 2 d . {:(23.29)dr=+-(1-2m//r)^(1//2)dℓ.:}\begin{equation*} d r= \pm(1-2 m / r)^{1 / 2} d \ell . \tag{23.29} \end{equation*}(23.29)dr=±(12m/r)1/2d.
Note that r r rrr is zero at the center of the star (where m r 3 m r 3 m propr^(3)m \propto r^{3}mr3 ), and note that r r rrr is always nonnegative by definition. Therefore r r rrr must at first increase with \ell as one moves outward from = 0 ; r ( ) = 0 ; r ( ) ℓ=0;r(ℓ)\ell=0 ; r(\ell)=0;r() can later reach a maximum and start decreasing only at a point where 2 m / r 2 m / r 2m//r2 m / r2m/r becomes unity [see equation (23.29)]. Such a behavior can and does happen in a closed model universe, a 3 -sphere of uniform density and radius a a aaa, where
r ( ) = a sin ( / a ) r ( ) = a sin ( / a ) r(ℓ)=a sin(ℓ//a)r(\ell)=a \sin (\ell / a)r()=asin(/a)
[see Chapter 27; especially the embedding diagram of Box 27.2(A)]. However, the field equations demand that such a system be dynamic. Here, on the contrary, attention is limited to a system where conditions are static. In such a system, the condition of hydrostatic equilibrium (23.28b) applies. Then the pressure gradient is given by an expression with the factor [ 1 2 m ( r ) / r ] [ 1 2 m ( r ) / r ] [1-2m(r)//r][1-2 m(r) / r][12m(r)/r] in its denominator. If 2 m / r 2 m / r 2m//r2 m / r2m/r approaches unity with increasing \ell in some region of the star, the pressure gradient
there becomes so large that one comes to the point p = 0 p = 0 p=0p=0p=0 (surface of the star) before one comes to any point where 2 m ( r ) / r 2 m ( r ) / r 2m(r)//r2 m(r) / r2m(r)/r might attain unit value. Moreover, after the surface of the star is passed, m m mmm remains constant, m ( r ) = M m ( r ) = M m(r)=Mm(r)=Mm(r)=M, and 2 m ( r ) / r 2 m ( r ) / r 2m(r)//r2 m(r) / r2m(r)/r decreases. Consequently, 2 m / r 2 m / r 2m//r2 m / r2m/r is always less than unity; and r ( ) r ( ) r(ℓ)r(\ell)r() cannot have a maximum, Q.e.D. (Details of the proof are left to the reader as exercise 23.9.)
Although the radii of curvature, r r rrr, and corresponding spherical surface areas, 4 π r 2 4 π r 2 4pir^(2)4 \pi r^{2}4πr2, increase monotonically from the center of a star outward, they do not increase at the same rate as they would in flat spacetime. In flat spacetime the rate of increase is given by d r / d ( d r / d ( dr//d(d r / d(dr/d( proper radial distance ) = d r / d = 1 ) = d r / d = 1 )=dr//dℓ=1)=d r / d \ell=1)=dr/d=1. In a star it is given by d r / d = ( 1 2 m / r ) 1 / 2 < 1 d r / d = ( 1 2 m / r ) 1 / 2 < 1 dr//dℓ=(1-2m//r)^(1//2) < 1d r / d \ell=(1-2 m / r)^{1 / 2}<1dr/d=(12m/r)1/2<1. Consequently, if one were to climb a long ladder outward from the center of a relativistic star, measuring for each successive spherical shell its Schwarzschild r r rrr-value ("proper circumference" / 2 π / 2 π //2pi/ 2 \pi/2π ), one would find these r r rrr-values to increase surprisingly slowly.
This strange behavior is most easily visualized by means of an "embedding diagram." It would be too much for any easy visualization if one were to attempt to embed the whole curved four-dimensional manifold in some higher-dimensional flat space. [See, however, Fronsdal (1959) and Clarke (1970) for a global embedding in 5 + 1 5 + 1 5+15+15+1 dimensions, and Kasner (1921b) for a local embedding in 4 + 2 4 + 2 4+24+24+2 dimensions. One can never embed a non-flat, vacuum metric ( G μ ν = 0 ) G μ ν = 0 (G_(mu nu)=0)\left(G_{\mu \nu}=0\right)(Gμν=0) in a flat space of 5 dimensions (Kasner, 1921c).] Therefore seek a simpler picture (Flamm 1916). Space at one time in the context of a static system has the same 3-geometry as space at another time. Therefore, depict 3 -space only as it is at one time, t = t = t=t=t= constant. Moreover, at any one time the space itself has spherical symmetry. Consequently, one slice through the center, r = 0 r = 0 r=0r=0r=0, that divides the space symmetrically into two halves (for example, the equatorial slice, θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2 ) has the same 2 -geometry as any other such slice (any selected angle of tilt, at any azimuth) through the center. Therefore limit attention to the 2 -geometry of the equatorial slice. The geometry on this slice is described by the line element
(23.30) d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 (23.30) d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 {:(23.30)ds^(2)=[1-2m(r)//r]^(-1)dr^(2)+r^(2)dphi^(2):}\begin{equation*} d s^{2}=[1-2 m(r) / r]^{-1} d r^{2}+r^{2} d \phi^{2} \tag{23.30} \end{equation*}(23.30)ds2=[12m(r)/r]1dr2+r2dϕ2
Now one may embed this two-dimensional curved-space geometry in the flat geometry of a Euclidean three-dimensional manifold.
If the curvature of the two-dimensional slice is zero or negligible, the embedding is trivial. In this event, identify the 2-geometry with the slice z = 0 z = 0 z=0z=0z=0 of the Euclidean 3 -space. Moreover, introduce into that 3 -space the familiar cylindrical coordinates z , r , ϕ z , r , ϕ z,r,phiz, r, \phiz,r,ϕ, that one employs for any problem with axial symmetry (see Fig. 23.1 and Box 23.2 for more detail). Then one recognizes the flat two-dimensional slice as the set of points of the Euclidean space with z = 0 z = 0 z=0z=0z=0, with ϕ ϕ phi\phiϕ running from 0 to 2 π 2 π 2pi2 \pi2π, and r r rrr from 0 to oo\infty. One has identified the r r rrr and ϕ ϕ phi\phiϕ of the slice with the r r rrr and ϕ ϕ phi\phiϕ of the Euclidean 3-space.
If the 2-geometry is curved, as it is when the equatorial section is taken through a real star, then maintain the identification between the r , ϕ r , ϕ r,phir, \phir,ϕ, of the slice and the r , ϕ r , ϕ r,phir, \phir,ϕ, of the Euclidean 3-geometry, but bend up the slice out of the plane z = 0 z = 0 z=0z=0z=0 (except at the origin, r = 0 r = 0 r=0r=0r=0 ). At the same time, insist that the bending be axially symmetric. In other words, require that the amount of the "lift" above the plane z = 0 z = 0 z=0z=0z=0 shall
(2) but increases more slowly than in flat spacetime
Embedding of spacetime in a flat space of higher dimensionality
Construction of "embedding diagram" for equatorial slice through star
Figure 23.1.
Geometry within (grey) and around (white) a star of radius R = 2.66 M R = 2.66 M R=2.66MR=2.66 \mathrm{M}R=2.66M, schematically displayed. The star is in hydrostatic equilibrium and has zero angular momentum (spherical symmetry). The twodimensional geometry
d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 ds^(2)=[1-2m(r)//r]^(-1)dr^(2)+r^(2)dphi^(2)d s^{2}=[1-2 m(r) / r]^{-1} d r^{2}+r^{2} d \phi^{2}ds2=[12m(r)/r]1dr2+r2dϕ2
of an equatorial slice through the star ( θ = π / 2 , t = ( θ = π / 2 , t = (theta=pi//2,t=(\theta=\pi / 2, t=(θ=π/2,t= constant ) ) ))) is represented as embedded in Euclidean 3 -space, in such a way that distances between any two nearby points ( r , ϕ ) ( r , ϕ ) (r,phi)(r, \phi)(r,ϕ) and ( r + d r , ϕ + d ϕ r + d r , ϕ + d ϕ r+dr,phi+d phir+d r, \phi+d \phir+dr,ϕ+dϕ ) are correctly reproduced. Distances measured off the curved surface have no physical meaning; points off that surface have no physical meaning; and the Euclidean 3 -space itself has no physical meaning. Only the curved 2-geometry has meaning. A circle of Schwarzschild coordinate radius r r rrr has proper circumference 2 π r 2 π r 2pi r2 \pi r2πr (attention limited to equatorial plane of star, θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2 ). Replace this circle by a sphere of proper area 4 π r 2 4 π r 2 4pir^(2)4 \pi r^{2}4πr2, similarly for all the other circles, in order to visualize the entire 3-geometry in and around the star at any chosen moment of Schwarzschild coordinate time t t ttt. The factor [ 1 2 m ( r ) / r ] 1 [ 1 2 m ( r ) / r ] 1 [1-2m(r)//r]^(-1)[1-2 m(r) / r]^{-1}[12m(r)/r]1 develops no singularity as r r rrr decreases within r = 2 M r = 2 M r=2Mr=2 Mr=2M, because m ( r ) m ( r ) m(r)m(r)m(r) decreases sufficiently fast with decreasing r r rrr.
be independent of ϕ ϕ phi\phiϕ, whatever may be its dependence on r r rrr. Thus the whole story of the embedding is summarized by the single function, the lift,
z = z ( r ) ("embedding formula"). z = z ( r )  ("embedding formula").  z=z(r)" ("embedding formula"). "z=z(r) \text { ("embedding formula"). }z=z(r) ("embedding formula"). 
The geometry on this curved two-dimensional locus in Euclidean space (a made-up 3 -space; it has nothing whatever to do with the real world) is to be identical with the geometry of the two-dimensional equatorial slice through the actual star; in other words, the line elements in the two cases are to be identical. To work out this requirement in mathematical terms, write the line element in three-dimensional Euclidean space in the form
(23.31) d s 2 = d z 2 + d r 2 + r 2 d ϕ 2 (23.31) d s 2 = d z 2 + d r 2 + r 2 d ϕ 2 {:(23.31)ds^(2)=dz^(2)+dr^(2)+r^(2)dphi^(2):}\begin{equation*} d s^{2}=d z^{2}+d r^{2}+r^{2} d \phi^{2} \tag{23.31} \end{equation*}(23.31)ds2=dz2+dr2+r2dϕ2
Restrict to the chosen locus ("lifted surface") by writing z = z ( r ) z = z ( r ) z=z(r)z=z(r)z=z(r) or d z = ( d z / d r ) d r d z = ( d z / d r ) d r dz=(dz//dr)drd z=(d z / d r) d rdz=(dz/dr)dr. Thus have
(23.32) d s 2 = [ 1 + ( d z ( r ) d r ) 2 ] d r 2 + r 2 d ϕ 2 (23.32) d s 2 = 1 + d z ( r ) d r 2 d r 2 + r 2 d ϕ 2 {:(23.32)ds^(2)=[1+((dz(r))/(dr))^(2)]dr^(2)+r^(2)dphi^(2):}\begin{equation*} d s^{2}=\left[1+\left(\frac{d z(r)}{d r}\right)^{2}\right] d r^{2}+r^{2} d \phi^{2} \tag{23.32} \end{equation*}(23.32)ds2=[1+(dz(r)dr)2]dr2+r2dϕ2
on the two-dimensional locus in the 3-geometry, to be identified with
d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 d s 2 = [ 1 2 m ( r ) / r ] 1 d r 2 + r 2 d ϕ 2 ds^(2)=[1-2m(r)//r]^(-1)dr^(2)+r^(2)dphi^(2)d s^{2}=[1-2 m(r) / r]^{-1} d r^{2}+r^{2} d \phi^{2}ds2=[12m(r)/r]1dr2+r2dϕ2
in the actual star. Compare and conclude
(23.33) ( d z ( r ) d r ) 2 + 1 = [ 1 2 m ( r ) / r ] 1 (23.33) d z ( r ) d r 2 + 1 = [ 1 2 m ( r ) / r ] 1 {:(23.33)((dz(r))/(dr))^(2)+1=[1-2m(r)//r]^(-1):}\begin{equation*} \left(\frac{d z(r)}{d r}\right)^{2}+1=[1-2 m(r) / r]^{-1} \tag{23.33} \end{equation*}(23.33)(dz(r)dr)2+1=[12m(r)/r]1
This equation is information enough to find the lift as a function of r r rrr; thus,
(23.34a) z ( r ) = 0 r d r [ r 2 m ( r ) 1 ] 1 / 2 everywhere, z ( r ) = [ 8 M ( r 2 M ) ] 1 / 2 + constant outside the star. (23.34a) z ( r ) = 0 r d r r 2 m ( r ) 1 1 / 2  everywhere,  z ( r ) = [ 8 M ( r 2 M ) ] 1 / 2 +  constant   outside the star.  {:(23.34a){:[z(r)=int_(0)^(r)(dr)/([(r)/(2m(r))-1]^(1//2))," everywhere, "],[z(r)=[8M(r-2M)]^(1//2)+" constant "," outside the star. "]:}:}\begin{array}{cc} z(r)=\int_{0}^{r} \frac{d r}{\left[\frac{r}{2 m(r)}-1\right]^{1 / 2}} & \text { everywhere, } \tag{23.34a}\\ z(r)=[8 M(r-2 M)]^{1 / 2}+\text { constant } & \text { outside the star. } \end{array}(23.34a)z(r)=0rdr[r2m(r)1]1/2 everywhere, z(r)=[8M(r2M)]1/2+ constant  outside the star. 
Outside the star this embedded surface is a segment of a paraboloid of revolution. Its form inside the star depends on how the mass, m m mmm, varies as a function of r r rrr. Recall
Description of embedded surface that m ( r ) m ( r ) m(r)m(r)m(r) varies as ( 4 π / 3 ) ρ c r 3 ( 4 π / 3 ) ρ c r 3 (4pi//3)rho_(c)r^(3)(4 \pi / 3) \rho_{c} r^{3}(4π/3)ρcr3 near the center of the star. Conclude that the embedded surface there looks like a segment of a sphere of radius a = ( 3 / 8 π ρ c ) 1 / 2 a = 3 / 8 π ρ c 1 / 2 a=(3//8pirho_(c))^(1//2)a=\left(3 / 8 \pi \rho_{c}\right)^{1 / 2}a=(3/8πρc)1/2; thus,
(23.34c) [ a z ( r ) ] 2 + r 2 = a 2 for r a = ( 3 / 8 π ρ c ) 1 / 2 (23.34c) [ a z ( r ) ] 2 + r 2 = a 2  for  r a = 3 / 8 π ρ c 1 / 2 {:(23.34c)[a-z(r)]^(2)+r^(2)=a^(2)quad" for "r≪a=(3//8pirho_(c))^(1//2):}\begin{equation*} [a-z(r)]^{2}+r^{2}=a^{2} \quad \text { for } r \ll a=\left(3 / 8 \pi \rho_{c}\right)^{1 / 2} \tag{23.34c} \end{equation*}(23.34c)[az(r)]2+r2=a2 for ra=(3/8πρc)1/2
In the special case of a star with uniform density (Box 23.2), the entire interior is of the spherical form (23.34c); in the general case it is not. In all cases, because r > 2 m ( r ) r > 2 m ( r ) r > 2m(r)r>2 m(r)r>2m(r), equation (23.34a) produces a surface with z z zzz and r r rrr as monotonically increasing functions of each other. This means that the embedded surface always opens upward and outward like a bowl; it always looks qualitatively like Figure 23.1; it never has a neck, and it never flattens out except asymptotically at r = r = r=oor=\inftyr=. At the star's surface, even though the density may drop discontinuously to zero ( ρ ρ rho\rhoρ finite inside when p = 0 ; ρ p = 0 ; ρ p=0;rhop=0 ; \rhop=0;ρ zero outside), the interior and exterior geometries will join together smoothly [ d z / d r d z / d r dz//drd z / d rdz/dr, as given by equation (23.33), is continuous].
It must be emphasized that only points lying on the embedded 2 -surface have physical significance so far as the stellar geometry is concerned: the three-dimensional regions inside and outside the bowl of Figure 23.1 are physically meaningless. So is the Euclidean embedding space. It merely permits one to visualize the geometry of space around the star in a convenient manner.

Exercise 23.9. GOOD BEHAVIOR OF r

EXERCISES

Carry out explicitly the full details of the proof, at the beginning of this section, that 2 m / r 2 m / r 2m//r2 \mathrm{~m} / r2 m/r is always less than unity and r r rrr is a monotonic function of \ell.

Exercise 23.10. CENTER OF STAR OCCUPIED BY IDEAL FERMI GAS AT EXTREME RELATIVISTIC LIMIT

Opposite to the idealization of a star built from an incompressible fluid is the idealization in which it is built from an ideal Fermi gas [ideal neutron star; see Oppenheimer and Volkoff (1939)] at zero temperature, so highly compressed that the particles have relativistic energies,
in comparison with which any rest mass they possess is negligible. In this limit, with two particles per occupied cell of volume h 3 h 3 h^(3)h^{3}h3 in phase space, one has
( number density of fermions ) = n = ( 2 / h 3 ) 4 π 0 p F p 2 d p = 8 π p F 3 / 3 h 3 , ( density of mass-energy ) = ρ = ( 2 / h 3 ) 4 π 0 p F c p p 2 d p = 2 π c p F 4 / h 3 , (  number density   of fermions  ) = n = 2 / h 3 4 π 0 p F p 2 d p = 8 π p F 3 / 3 h 3 , (  density of   mass-energy  ) = ρ = 2 / h 3 4 π 0 p F c p p 2 d p = 2 π c p F 4 / h 3 , {:[((" number density ")/(" of fermions "))=n=(2//h^(3))4piint_(0)^(p_(F))p^(2)dp=8pip_(F)^(3)//3h^(3)","],[((" density of ")/(" mass-energy "))=rho=(2//h^(3))4piint_(0)^(p_(F))cp*p^(2)dp=2pi cp_(F)^(4)//h^(3)","]:}\begin{aligned} & \binom{\text { number density }}{\text { of fermions }}=n=\left(2 / h^{3}\right) 4 \pi \int_{0}^{p_{\mathrm{F}}} p^{2} d p=8 \pi p_{\mathrm{F}}^{3} / 3 h^{3}, \\ & \binom{\text { density of }}{\text { mass-energy }}=\rho=\left(2 / h^{3}\right) 4 \pi \int_{0}^{p_{\mathrm{F}}} c p \cdot p^{2} d p=2 \pi c p_{\mathrm{F}}^{4} / h^{3}, \end{aligned}( number density  of fermions )=n=(2/h3)4π0pFp2dp=8πpF3/3h3,( density of  mass-energy )=ρ=(2/h3)4π0pFcpp2dp=2πcpF4/h3,
and finally
p = d ( energy per particle ) d ( volume per particle ) = d ( ρ / n ) d ( 1 / n ) = 2 π c p F 4 / 3 h 3 = ρ / 3 p = d (  energy   per particle  ) d (  volume   per particle  ) = d ( ρ / n ) d ( 1 / n ) = 2 π c p F 4 / 3 h 3 = ρ / 3 p=-(d((" energy ")/(" per particle ")))/(d((" volume ")/(" per particle ")))=-(d(rho//n))/(d(1//n))=2pi cp_(F)^(4)//3h^(3)=rho//3p=-\frac{d\binom{\text { energy }}{\text { per particle }}}{d\binom{\text { volume }}{\text { per particle }}}=-\frac{d(\rho / n)}{d(1 / n)}=2 \pi c p_{\mathrm{F}}^{4} / 3 h^{3}=\rho / 3p=d( energy  per particle )d( volume  per particle )=d(ρ/n)d(1/n)=2πcpF4/3h3=ρ/3
as if one were dealing with radiation instead of particles ( p F = p F = p_(F)=p_{F}=pF= Fermi momentum; momentum of highest occupied state).

Box 23.3 RIGOROUS DERIVATION OF THE SPHERICALLY SYMMETRIC LINE ELEMENT

Section 23.2 gave a heuristic derivation of the general spherically symmetric line element (23.7). This box attempts a more rigorous derivation, applicable to nonstatic systems, as well as static ones.
Begin with a manifold M 4 M 4 M^(4)M^{4}M4 on which a metric d s 2 d s 2 ds^(2)d s^{2}ds2 of Lorentz signature is defined. Assume M 4 M 4 M^(4)M^{4}M4 to be spherically symmetric in the sense that to any 3 × 3 3 × 3 3xx33 \times 33×3 rotation matrix A A AAA there corresponds a mapping (rotation) of M 4 M 4 M^(4)M^{4}M4, also called A ( A : M 4 A A : M 4 A(A:M^(4):}A\left(A: M^{4}\right.A(A:M4 M 4 : P A P M 4 : P A P longrightarrowM^(4):Plongrightarrow AP\longrightarrow M^{4}: \mathscr{P} \longrightarrow A \mathscr{P}M4:PAP ), that preserves the lengths of all curves. Further assumptions and constructions will be numbered (i), (ii), etc., so one can see what specializations are needed to get to the line element (23.7). Daggers ( \dagger ) indicate assumptions that are found inapplicable to some other physically interesting situations.
For any point P P P\mathscr{P}P, form the set s = S ( P ) = s = S ( P ) = s=S(P)=s=S(\mathscr{P})=s=S(P)= { A P M 4 A S O ( 3 ) } A P M 4 A S O ( 3 ) {APinM^(4)∣A in SO(3)}\left\{A \mathscr{P} \in M^{4} \mid A \in S O(3)\right\}{APM4ASO(3)} of all points equivalent to P P P\mathscr{P}P under rotations. Assume (i) \dagger that s s sss is a two-dimensional surface (except for center points, where s s sss is zero-dimensional), and (ii) that the metric on s s sss is that of a standard 2 -sphere. Then on s s sss one will have
(1) ( d s 2 ) s = R 2 ( s ) d Ω 2 (1) d s 2 s = R 2 ( s ) d Ω 2 {:(1)(ds^(2))_(s)=R^(2)(s)dOmega^(2):}\begin{equation*} \left(d s^{2}\right)_{s}=R^{2}(s) d \Omega^{2} \tag{1} \end{equation*}(1)(ds2)s=R2(s)dΩ2
where d Ω 2 d Ω 2 dOmega^(2)d \Omega^{2}dΩ2 is the standard metric of a unit sphere ( d Ω 2 = d θ 2 + sin 2 θ d ϕ 2 d Ω 2 = d θ 2 + sin 2 θ d ϕ 2 (dOmega^(2)=dtheta^(2)+sin^(2)theta dphi^(2):}\left(d \Omega^{2}=d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right.(dΩ2=dθ2+sin2θdϕ2 for some θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ, defined on s ) s ) s)s)s), and where 2 π R 2 π R 2pi R2 \pi R2πR is the circumference of s s sss. If M 2 M 2 M^(2)M^{2}M2 is the set of all such surfaces s s sss, then S : M 4 S : M 4 S:M^(4)longrightarrowS: M^{4} \longrightarrowS:M4
M 2 : P s = S ( P ) M 2 : P s = S ( P ) M^(2):Plongrightarrow s=S(P)M^{2}: \mathscr{P} \longrightarrow s=S(\mathscr{P})M2:Ps=S(P) allows one to obtain, from R : M 2 R : s R ( s ) R : M 2 R : s R ( s ) R:M^(2)longrightarrowR:s longrightarrow R(s)R: M^{2} \longrightarrow \mathscr{R}: s \longrightarrow R(s)R:M2R:sR(s) [the "circumference" function on M 2 M 2 M^(2)M^{2}M2 as defined by equation (1)], a corresponding function R : M 4 : P R : M 4 : P R:M^(4)longrightarrowℜ:PlongrightarrowR: M^{4} \longrightarrow \Re: \mathscr{P} \longrightarrowR:M4:P R ( S ( P ) ) R ( S ( P ) ) R(S(P))R(S(\mathscr{P}))R(S(P)) on M 4 M 4 M^(4)M^{4}M4 which in some cases can eventually be used as a coordinate on M 4 M 4 M^(4)M^{4}M4. (Note: Ω Ω Omega\mathscr{\Omega}Ω denotes here the real numbers.)
Now assume (iii) \dagger there is a spherically symmetric 4 -velocity field u u u\boldsymbol{u}u, defined so that if P = C ( τ ) P = C ( τ ) P=C(tau)\mathscr{P}=\mathcal{C}(\tau)P=C(τ) is one trajectory of u u u\boldsymbol{u}u with u = d / d τ u = d / d τ u=d//d tau\boldsymbol{u}=d / d \tauu=d/dτ, then each curve P = A C ( τ ) P = A C ( τ ) P=AC(tau)\mathscr{P}=A \mathcal{C}(\tau)P=AC(τ) obtained by a rotation must also be a trajectory of u u u\boldsymbol{u}u. The orthogonal projection of u u u\boldsymbol{u}u onto any sphere s s sss must then vanish, as there are no rotation invariant non-zero vector fields on 2 -spheres. Thus u u u\boldsymbol{u}u is orthogonal to each s s sss. Also, if two trajectories of u u u\boldsymbol{u}u start on some same sphere s s sss, so C 1 ( 0 ) = A C 2 ( 0 ) C 1 ( 0 ) = A C 2 ( 0 ) C_(1)(0)=AC_(2)(0)\mathcal{C}_{1}(0)=A \mathcal{C}_{2}(0)C1(0)=AC2(0), then the same rotation A A AAA will always relate them, C 1 ( τ ) = A C 2 ( τ ) C 1 ( τ ) = A C 2 ( τ ) C_(1)(tau)=AC_(2)(tau)\mathcal{C}_{1}(\tau)=A \mathcal{C}_{2}(\tau)C1(τ)=AC2(τ), since trajectories are uniquely defined by any one point on them. Then S ( C 1 ( τ ) ) S C 1 ( τ ) S(C_(1)(tau))S\left(\mathcal{C}_{1}(\tau)\right)S(C1(τ)) and S ( C 2 ( τ ) ) S C 2 ( τ ) S(C_(2)(tau))S\left(\mathcal{C}_{2}(\tau)\right)S(C2(τ)) are both the same curve in M 2 M 2 M^(2)M^{2}M2, whose tangent d / d τ d / d τ d//d taud / d \taud/dτ one can call also u u u\boldsymbol{u}u; in this way one obtains a vector field u u u\boldsymbol{u}u on M 2 M 2 M^(2)M^{2}M2. Give each trajectory of u u u\boldsymbol{u}u on M 2 M 2 M^(2)M^{2}M2 a different label r r rrr to define a function r ( s ) r ( s ) r(s)r(s)r(s) on M 2 M 2 M^(2)M^{2}M2. Denote by r = r ( S ( P ) ) r = r ( S ( P ) ) r=r(S(P))r=r(S(\mathscr{P}))r=r(S(P)) a corresponding function r r rrr on M 4 M 4 M^(4)M^{4}M4 with d r / d τ = 0 d r / d τ = 0 dr//d tau=0d r / d \tau=0dr/dτ=0. Since functions and their gradients on M 4 M 4 M^(4)M^{4}M4 define corresponding quantities on M 2 M 2 M^(2)M^{2}M2, inner products such as d f d g d f d g df*dg\boldsymbol{d} f \cdot \boldsymbol{d} gdfdg can be defined on M 2 M 2 M^(2)M^{2}M2 by their values on M 4 M 4 M^(4)M^{4}M4; thus, from the metric on M 4 M 4 M^(4)M^{4}M4 one obtains a metric on M 2 M 2 M^(2)M^{2}M2. Then by equa-
(a) Write out the relativistic equation of hydrostatic equilibrium for a substance satisfying the equation of state p = ρ / 3 p = ρ / 3 p=rho//3p=\rho / 3p=ρ/3.
(b) Show that there exists a well-defined analytic solution for the limiting case of infinite central density, in which m ( r ) / r m ( r ) / r m(r)//rm(r) / rm(r)/r has the value 3 / 14 3 / 14 3//143 / 143/14.
(c) Find ρ ( r ) , p ( r ) ρ ( r ) , p ( r ) rho(r),p(r)\rho(r), p(r)ρ(r),p(r), and n ( r ) n ( r ) n(r)n(r)n(r).
(d) Show that the number of particles out to any finite r r rrr-value is finite, despite the fact that n ( r ) n ( r ) n(r)n(r)n(r) is infinite at the origin.
(e) Show that the 3-geometry has a "conical singularity" at r = 0 r = 0 r=0r=0r=0.
(f) Make an "embedding diagram" for this 3-geometry ["lift" z ( r ) z ( r ) z(r)z(r)z(r) as a function of r r rrr from (23.34)]. (Note that the conical singularity at r = 0 r = 0 r=0r=0r=0, otherwise physically unreasonable, arises because the density of mass-energy goes to infinity at that point. Note also that the calculated mass of the system diverges to infinity as r r r longrightarrow oor \longrightarrow \inftyr. In actuality with decreasing density the Fermi momentum falls from relativistic to nonrelativistic values, the equation of state changes its mathematical form, and the total mass M M MMM converges to a finite value).
tion (23.5) or equivalently by drawing curves in M 2 M 2 M^(2)M^{2}M2 orthogonal to the r = r = r=r=r= const. lines, and giving each a different label t t ttt, one obtains coordinates with g r t = d r d t = 0 g r t = d r d t = 0 g^(rt)=dr*dt=0g^{r t}=\boldsymbol{d} r \cdot \boldsymbol{d} t=0grt=drdt=0. Both r r rrr and t t ttt labels were assigned arbitrarily on the corresponding curves, so it is clear that transformations t = t ( t ) t = t ( t ) t^(')=t^(')(t)t^{\prime}=t^{\prime}(t)t=t(t) and r = r ( r ) r = r ( r ) r^(')=r^(')(r)r^{\prime}=r^{\prime}(r)r=r(r) are not excluded.
On one 2-sphere s s sss in M 4 M 4 M^(4)M^{4}M4, on the t = 0 t = 0 t=0t=0t=0 hypersurface, choose a set of θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ coordinates by picking the pole ( θ = 0 ) ( θ = 0 ) (theta=0)(\theta=0)(θ=0) and the prime meridian ( ϕ = 0 ) ( ϕ = 0 ) (phi=0)(\phi=0)(ϕ=0) arbitrarily. Then extend the definition of θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ, over the t = 0 t = 0 t=0t=0t=0 hypersurface by requiring θ θ theta\thetaθ and ϕ ϕ phi\phiϕ to be constant on curves orthogonal to each 2 -sphere s s sss, i.e., by demanding that ( / r ) θ ϕ ( / r ) θ ϕ (del//del r)_(theta phi)(\partial / \partial r)_{\theta \phi}(/r)θϕ be orthogonal to each s s sss at t = 0 t = 0 t=0t=0t=0. Extend the definition of θ θ theta\thetaθ and ϕ ϕ phi\phiϕ to t 0 t 0 t!=0t \neq 0t0 by requiring them to be constant on curves with tangent u u u\boldsymbol{u}u, so ( / t ) r θ ϕ u ( / t ) r θ ϕ u (del//del t)_(r theta phi)prop u(\partial / \partial t)_{r \theta \phi} \propto \boldsymbol{u}(/t)rθϕu. But each s s sss is a surface of constant r r rrr and t t ttt; so ( / θ ) r t ϕ ( / θ ) r t ϕ (del//del theta)_(rt phi)(\partial / \partial \theta)_{r t \phi}(/θ)rtϕ and ( / ϕ ) r t θ ( / ϕ ) r t θ (del//del phi)_(rt theta)(\partial / \partial \phi)_{r t \theta}(/ϕ)rtθ are tangent to s s sss, while u ( / t ) u ( / t ) u prop(del//del t)\boldsymbol{u} \propto(\partial / \partial t)u(/t) is orthogonal to each s s sss. Consequently,
and
(2) g t θ = ( / t ) ( / θ ) = 0 (2) g t θ = ( / t ) ( / θ ) = 0 {:(2)g_(t theta)=(del//del t)*(del//del theta)=0:}\begin{equation*} g_{t \theta}=(\partial / \partial t) \cdot(\partial / \partial \theta)=0 \tag{2} \end{equation*}(2)gtθ=(/t)(/θ)=0
(3) g t ϕ = ( / t ) ( / ϕ ) = 0 (3) g t ϕ = ( / t ) ( / ϕ ) = 0 {:(3)g_(t phi)=(del//del t)*(del//del phi)=0:}\begin{equation*} g_{t \phi}=(\partial / \partial t) \cdot(\partial / \partial \phi)=0 \tag{3} \end{equation*}(3)gtϕ=(/t)(/ϕ)=0
in the t r θ ϕ ˙ t r θ ϕ ˙ tr thetaphi^(˙)t r \theta \dot{\phi}trθϕ˙ coordinate system just constructed. The vector ( / r ) t θ ϕ ( / r ) t θ ϕ (del//del r)_(t theta phi)(\partial / \partial r)_{t \theta \phi}(/r)tθϕ does not depend on the arbitrary directions introduced in the original choice of θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ coordinates on one sphere s s sss; it is invariant under transformations θ = θ ( θ , ϕ ) , ϕ = ϕ ( θ , ϕ ) θ = θ θ , ϕ , ϕ = ϕ θ , ϕ theta=theta(theta^('),phi^(')),phi=phi(theta^('),phi^('))\theta=\theta\left(\theta^{\prime}, \phi^{\prime}\right), \phi=\phi\left(\theta^{\prime}, \phi^{\prime}\right)θ=θ(θ,ϕ),ϕ=ϕ(θ,ϕ). But nothing except θ θ theta\thetaθ and ϕ ϕ phi\phiϕ introduced nonrotationally invariant elements into the discussion; so ( / r ) t θ ϕ ( / r ) t θ ϕ (del//del r)_(t theta phi)(\partial / \partial r)_{t \theta \phi}(/r)tθϕ must be a rotationally invariant vector field (un-
like, say, / ϕ ) / ϕ ) del//del phi)\partial / \partial \phi)/ϕ); so it is, like u u u\boldsymbol{u}u, orthogonal to each 2 -sphere s s sss. This invariance then gives
(4) g r θ = ( / r ) ( / θ ) = 0 , (5) g r ϕ = ( / r ) ( / ϕ ) = 0 , (4) g r θ = ( / r ) ( / θ ) = 0 , (5) g r ϕ = ( / r ) ( / ϕ ) = 0 , {:[(4)g_(r theta)=(del//del r)*(del//del theta)=0","],[(5)g_(r phi)=(del//del r)*(del//del phi)=0","]:}\begin{align*} & g_{r \theta}=(\partial / \partial r) \cdot(\partial / \partial \theta)=0, \tag{4}\\ & g_{r \phi}=(\partial / \partial r) \cdot(\partial / \partial \phi)=0, \tag{5} \end{align*}(4)grθ=(/r)(/θ)=0,(5)grϕ=(/r)(/ϕ)=0,
which, with g t r = 0 g t r = 0 g^(tr)=0g^{t r}=0gtr=0 as previously established, gives g t r = 0 g t r = 0 g_(tr)=0g_{t r}=0gtr=0. The result is a line element of the form (23.3). Further specialization, a change of radial and time coordinates to R R RRR and T T TTT, where R R RRR is defined by (1) above and
d T = e ψ [ 1 g r r R r d t 1 g t t R t d r ] , e ψ = ( integrating factor ) , d T = e ψ 1 g r r R r d t 1 g t t R t d r , e ψ = (  integrating   factor  ) , {:[dT=e^(psi)[(1)/(g_(rr))(del R)/(del r)dt-(1)/(g_(tt))(del R)/(del t)dr]","],[e^(psi)=((" integrating ")/(" factor "))","]:}\begin{aligned} \boldsymbol{d} T & =e^{\psi}\left[\frac{1}{g_{r r}} \frac{\partial R}{\partial r} \boldsymbol{d} t-\frac{1}{g_{t t}} \frac{\partial R}{\partial t} \boldsymbol{d} r\right], \\ e^{\psi} & =\binom{\text { integrating }}{\text { factor }}, \end{aligned}dT=eψ[1grrRrdt1gttRtdr],eψ=( integrating  factor ),
followed by a change of notation, leads to Schwarzschild coordinates and the line element (23.7)-though such a transformation is possible (i.e., nonsingular) only where d R d T 0 d R d T 0 dR^^dT!=0\boldsymbol{d} R \wedge \boldsymbol{d} T \neq 0dRdT0 :
( R ) 2 = ( R / t ) 2 g t t + ( R / r ) 2 g π r 0 ( R ) 2 = ( R / t ) 2 g t t + ( R / r ) 2 g π r 0 (grad R)^(2)=((del R//del t)^(2))/(g_(tt))+((del R//del r)^(2))/(g_(pi r))!=0(\nabla R)^{2}=\frac{(\partial R / \partial t)^{2}}{g_{t t}}+\frac{(\partial R / \partial r)^{2}}{g_{\pi r}} \neq 0(R)2=(R/t)2gtt+(R/r)2gπr0
If (iv) \dagger spacetime is asymptotically flat, so r r r longrightarrow oor \longrightarrow \inftyr is a region where the metric can take on its special relativity values, then the arbitrariness in the t t ttt coordinate, t = t ( t ) t = t ( t ) t^(')=t^(')(t)t^{\prime}=t^{\prime}(t)t=t(t), can be eliminated by requiring g t t = 1 g t t = 1 g_(tt)=-1g_{t t}=-1gtt=1 as r r r longrightarrow oor \longrightarrow \inftyr. Then ( / t ) r θ ϕ ( / t ) r θ ϕ (del//del t)_(r theta phi)(\partial / \partial t)_{r \theta \phi}(/t)rθϕ is uniquely determined by natural requirements (independent of the arbitrary θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ, choices), and whenever it is desired to make the further physical assumption (v) \dagger of a time-independent geometry, this can be appropriately restated as g μ ν / t = 0 g μ ν / t = 0 delg_(mu nu)//del t=0\partial g_{\mu \nu} / \partial t=0gμν/t=0.

anme 24

PULSARS AND NEUTRON STARS; QUASARS AND SUPERMASSIVE STARS

Go, wond'rous creature, mount where Science guides, Go, measure earth, weigh air, and state the tides;
Instruct the planets in what orbs to run,
Correct old time, and regulate the sun.
ALEXANDER POPE (1733)

§24.1. OVERVIEW

Types of stellar configurations where relativity should be important
Five kinds of stellar configurations are recognized in which relativistic effects should be significant: white dwarfs, neutron stars, black holes, supermassive stars, and relativistic star clusters. The key facts about each type of configuration are summarized in Box 24.1; and the most important details are described in the text of this chapter (white dwarfs in § 24.2 § 24.2 §24.2\S 24.2§24.2; neutron stars and their connection to pulsars in § § 24.2 § § 24.2 §§24.2\S \S 24.2§§24.2 and 24.3 ; supermassive stars and their possible connection to quasars and galactic nuclei in § § 24.4 § § 24.4 §§24.4\S \S 24.4§§24.4 and 24.5 ; and relativistic star clusters in § 24.6 § 24.6 §24.6\S 24.6§24.6; a detailed discussion of black holes is delayed until Chapter 33).
The book Stars and Relativity by Zel'dovich and Novikov (1971) presents a clear and very complete treatment of all these astrophysical applications of relativistic stellar theory. In a sense, that book can be regarded as a companion volume to this one; it picks up, with astrophysical emphasis, all the topics that this book treats with gravitational emphasis. This chapter is meant only to give the reader a brief survey of the material to be found in Stars and Relativity.
(continued on page 621)
Box 24.1. STELLAR CONFIGURATIONS WHERE RELATIVISTIC EFFECTS ARE IMPORTANT
[For detailed analyses and references on all these topics, see Zel'dovich and Novikov (1971).]

A. White Dwarf Stars

Are stars of about one solar mass, with radii about 5,000 kilometers and densities about 10 6 g / cm 3 1 10 6 g / cm 3 1 10^(6)g//cm^(3)∼110^{6} \mathrm{~g} / \mathrm{cm}^{3} \sim 1106 g/cm31 ton / cm 3 / cm 3 //cm^(3)/ \mathrm{cm}^{3}/cm3; support themselves against gravity by the pressure of degenerate electrons; have stopped burning nuclear fuel, and are gradually cooling as they radiate away their remaining store of thermal energy.
Were observed and studied astronomically long before they were understood theoretically.
Key points in history:
August 1926, Dirac (1926) formulated FermiDirac statistics, following Fermi (February). December 1926, R. H. Fowler (1926) used Fermi-Dirac statistics to explain the nature of white dwarfs; he invoked electron degeneracy pressure to hold the star out against the inward pull of gravity.
1930, S. Chandrasekhar (1931a,b) calculated white-dwarf models taking account of special relativistic effects in the electron-degeneracy equation of state; he discovered that no white dwarf can be more massive than 1.2 1.2 ∼1.2\sim 1.21.2 solar masses ("Chandrasekhar Limit").
1932, L. D. Landau (1932) gave an elementary explanation of the Chandrasekhar limit.
1949, S. A. Kaplan (1949) derived the effects of general relativity on the mass-radius curve for massive white dwarfs, and deduced that general relativity probably induces an instability when the radius becomes smaller than 1.1 × 10 3 km 1.1 × 10 3 km 1.1 xx10^(3)km1.1 \times 10^{3} \mathrm{~km}1.1×103 km.
Role of general relativity in white dwarfs:
negligible influence on structure;
significant influence on stability, on pulsation
frequencies, and on form of mass-radius curve near the Chandrasekhar limit (i.e., in massive white dwarfs). Electron capture also significant. See, e.g., Zel'dovich and Novikov (1971); Faulkner and Gribbin (1968).

B. Neutron Stars

Are stars of about one solar mass, with radii about 10 km and densities about 10 14 g / cm 3 10 14 g / cm 3 10^(14)g//cm^(3)10^{14} \mathrm{~g} / \mathrm{cm}^{3}1014 g/cm3 (same as density of an atomic nucleus); are supported against gravity by the pressure of degenerate neutrons and by nucleon-nucleon strong-interaction forces; are not burning nuclear fuel; the energy being radiated is the energy of rotation and the remaining store of internal thermal energy.
Theoretical calculations predicted their existence in 1934, but they were not verified to exist observationally until 1968.
Key points in history:
1932, neutron discovered by Chadwick (1932). 1933-34, Baade and Zwicky (1934a,b,c) (1) invented the concept of neutron star; (2) identified a new class of astronomical objects which they called "supernovae"; (3) suggested that supernovae might be created by the collapse of a normal star to form a neutron star. (See Figure 24.1.)
1939, Oppenheimer and Volkoff (1939) performed the first detailed calculations of the structures of neutron stars; in the process, they laid the foundations of the general relativistic theory of stellar structure as presented in Chapter 23. (See Figure 24.1.)
1942, Duyvendak (1942) and Mayall and Oort (1942) deduced that the Crab nebula is a remnant of the supernova observed by Chi-
Box 24.1 (continued)
nese astronomers in A.D. 1054. Baade (1942) and Minkowskii (1942) identified the "south preceding star," near the center of the Crab Nebula, as probably the (collapsed) remnant of the star that exploded in 1054 (see frontispiece).
1967, Pulsars were discovered by Hewish et al. (1968).
1968, Gold (1968) advanced the idea that pulsars are rotating neutron stars; and subsequent observations confirmed this suggestion.
1969, Cocke, Disney, and Taylor (1969) discovered that the "south preceding star" of the Crab nebula is a pulsar, thereby clinching the connection between supernovae, neutron stars, and pulsars.
Role of general relativity in neutron stars:
significant effects (as much as a factor of 2 ) on structure and vibration periods;
gravitational radiation reaction may be the dominant force that damps nonradial vibrations.

C. Black Holes

Are objects created when a star collapses to a size smaller than twice its geometrized mass ( R < 2 M ( M / M ) × 3 km ) R < 2 M M / M × 3 km (R < 2M∼(M//M_(o.))xx3(km))\left(R<2 M \sim\left(M / M_{\odot}\right) \times 3 \mathrm{~km}\right)(R<2M(M/M)×3 km), thereby creating such strong spacetime curvatures that it can no longer communicate with the external universe (detailed analysis of black holes in Chapters 33 and 34).
No one who accepts general relativity has found any way to escape the prediction that black holes must exist in our galaxy. This prediction depends in no way on the complexity of the collapse that forms the black holes, or on unknown properties of matter at high density. However, the existence of black holes has not yet been verified observationally.
Key points in history:
1795, Laplace (1795) noted that, according to Newtonian gravity and Newton's corpuscular theory of light, light cannot escape from a sufficiently massive object (Figure 24.1).
1939, Oppenheimer and Snyder (1939) calculated the collapse of a homogeneous sphere of pressure-free fluid, using general relativity, and discovered that the sphere cuts itself off from communication with the rest of the universe. This was the first calculation of how a black hole can form (Figure 24.1).
1965, Beginning of an era of intensive theoretical investigation of black-hole physics. Role of general relativity in black-hole physics: No sensible account of black holes possible in Newtonian theory. The physics of black holes calls on Einstein's description of gravity from beginning to end.

D. Supermassive Stars

Are stars of mass between 10 3 10 3 10^(3)10^{3}103 and 10 9 10 9 10^(9)10^{9}109 solar masses, constructed from a hot plasma of density typically less than that in normal stars; are supported primarily by the pressure of photons, which are trapped in the plasma and are in thermal equilibrium with it; burn nuclear fuel (hydrogen) at some stages in their evolution.
Theoretical calculations suggest (but not with complete confidence) that supermassive stars exist in the centers of galaxies and quasars, and perhaps elsewhere. Supermassive stars conceivably could be the energy sources for some quasars and galactic nuclei. However, astronomical observations have not yet yielded definitive evidence about their existence or their roles in the universe if they do exist.
Key points in history:
1963, Hoyle and Fowler (1963a,b) conceived the idea of supermassive stars, calculated their properties, and suggested that they might be associated with galactic nuclei and quasars.
1963-64, Chandrasekhar (1964a,b) and Feynman (1964) developed the general relativistic theory of stellar pulsations; and Feynman used it to show that supermassive stars, although Newtonian in structure, are subject to a general-relativistic instability.
1964 and after, calculations by many workers have elaborated on and extended the ideas of Hoyle and Fowler, but have not produced any spectacular breakthrough.
Role of general relativity in supermassive stars: negligible influence on structure, except in the extreme case of a compact, rapidly rotating, disc-like configuration [see Bardeen and Wagoner (1971); Salpeter and Wagoner (1971)].
significant influence on stability.

E. Relativistic Star Clusters

Are clusters of stars so dense that relativistic corrections to Newtonian theory modify their structure.
Theoretical calculations suggest that relativistic star clusters might, but quite possibly do not, form in the nuclei of some galaxies and quasars; if they do try to form, they might be destroyed during formation by star-star collisions, which convert the cluster into supermassive stars or into a dense conglomerate of stars and gas. Astronomical observations have yielded no definitive evidence, as yet, about the existence of relativistic clusters.
Key points in history:
1965, Zel'dovich and Podurets (1965) conceived the idea of relativistic star clusters, developed the theory of their structure using general relativity and kinetic theory (cf. $ 25.7 $ 25.7 $25.7\$ 25.7$25.7 ), and speculated about their stability.
1968, Ipser (1969) developed the theory of star-cluster stability and showed (in agreement with the Zel'dovich-Podurets speculations) that, when it becomes too dense, a cluster begins to collapse to form a black hole.
Role of general relativity in star clusters:
significant effect on structure when gravitational redshift from center to infinity exceeds z c Δ λ / λ 0.05 z c Δ λ / λ 0.05 z_(c)-=Delta lambda//lambda∼0.05z_{c} \equiv \Delta \lambda / \lambda \sim 0.05zcΔλ/λ0.05.
induces collapse of cluster to form black hole when central redshift reaches z c 0.50 z c 0.50 z_(c)~~0.50z_{c} \approx 0.50zc0.50.

§24.2. THE ENDPOINT OF STELLAR EVOLUTION

After the normal stages of evolution, stars "die" by a variety of processes. Some stars explode, scattering themselves into the interstellar medium; others contract into a white-dwarf state; and others-according to current theory-collapse to a neutron-star state, or beyond, into a black hole. Although one knows little at present about a star's dynamic evolution into its final state, much is known about the final states themselves. The final states include dispersed nebulae, which are of no interest here; cold stellar configurations, the subject of this section; and "black holes," the subject of Part VII.

Minutes of the Stanford Meeting, Decemper 15-16, 1933

  1. Supernovae and Cosmic Rays. W. BaAde, Mt. Wilson Observatory, aND F. flare up in every stellar system of Technology. - Supernovae flare up ine lifetime of a super(nebula) once in severy days and its absolute brightness at nova is about maximum may be as high as M vis = 14 M . The visible matimes the  maximum may be as high as  M vis  = 14 M . The visible   matimes the  {:[" maximum may be as high as "M_("vis ")=-14^(M)". The visible "],[" matimes the "]:}\begin{aligned} & \text { maximum may be as high as } M_{\text {vis }}=-14^{M} \text {. The visible } \\ & \text { matimes the }\end{aligned} maximum may be as high as Mvis =14M. The visible  matimes the  madiation L ν L ν L_(nu)L_{\nu}Lν of a supernova is about 10 8 10 8 10^(8)10^{8}108 times the radiation of our sun, that is, L ν = 3.78 × 10 4 L ν = 3.78 × 10 4 L_(nu)=3.78 xx10^(4)L_{\nu}=3.78 \times 10^{4}Lν=3.78×104 visible and invisible, is indicate that the total radiation, 48 ergs / sec 48 ergs / sec ^(48)ergs//sec{ }^{48} \mathrm{ergs} / \mathrm{sec}48ergs/sec. The superof the order L τ = 10 7 L ν = 3.18 × L τ = 10 7 L ν = 3.18 × L_(tau)=10^(7)L_(nu)=3.18 xxL_{\tau}=10^{7} L_{\nu}=3.18 \timesLτ=107Lν=3.18× its life a total energy
    quite ordinary stars of mass M < 10 34 g , E τ / c 2 M < 10 34 g , E τ / c 2 M < 10^(34)g,E_(tau)//c^(2)M<10^{34} \mathrm{~g}, E_{\tau} / c^{2}M<1034 g,Eτ/c2 is of the same order as M M MMM itself. In the supernovapothesis suggests sulk is annihilated. In addition the supernovae. Assuming itself that cosmic rays are producena occurs every thousand that in every nebula one supernovic rays to be observed on years, the intensity of the cosmer σ = 2 × 10 3 erg / cm 2 sec σ = 2 × 10 3 erg / cm 2 sec sigma=2xx10^(-3)erg//cm^(2)sec\sigma=2 \times 10^{-3} \mathrm{erg} / \mathrm{cm}^{2} \mathrm{sec}σ=2×103erg/cm2sec. the earth should be of the order σ = 2 × 3 × 10 3 erg / cm 2 σ = 2 × 3 × 10 3 erg / cm 2 sigma=2xx3xx10^(-3)erg//cm^(2)\sigma=2 \times 3 \times 10^{-3} \mathrm{erg} / \mathrm{cm}^{2}σ=2×3×103erg/cm2 The observational values are With all reserve we advance the sec . (Millikan, Regener). With the transitions from view that supernovae represent which in their final stages ordinary stars into neursely packed neutrons. nova therefore emits during if supernovae initially are consist of extremely closely packed neutrons E τ 10 5 L τ = 3.78 × 10 53 ergs E τ 10 5 L τ = 3.78 × 10 53 ergs E_(tau) >= 10^(5)L_(tau)=3.78 xx10^(53)ergsE_{\tau} \geq 10^{5} L_{\tau}=3.78 \times 10^{53} \mathrm{ergs}Eτ105Lτ=3.78×1053ergs. If supernovae initially are

On Massive Neutron Cores

J. R. Oppenheimer and G. M. Volkoff
Department of Physics, University of California, Berkeley, California
(Received January 3, 1939)
It has been suggested that, when the pressure within stellar matter becomes high enough, a new phase consisting of neutrons will be formed. In this paper we study the gravitational equilibrium of masses of neutrons, using the equation of state for a cold Fermi gas, and general relativity. For masses under 1 3 1 3 (1)/(3)o.\frac{1}{3} \odot13 only one equilibrium solution exists, which is approximately described by the nonrelativistic Fermi equation of state and Newtonian gravitational theory. For masses 1 3 < m < 3 4 1 3 < m < 3 4 (1)/(3)o. < m < (3)/(4)o.\frac{1}{3} \odot<m<\frac{3}{4} \odot13<m<34 two solutions exist, one stable and quasi-Newtonian, one more condensed, and unstable. For masses greater than 3 4 3 4 (3)/(4)o.\frac{3}{4} \odot34 there are no static equilibrium solutions. These results are qualitatively confirmed by comparison with suitably chosen special cases of the analytic solutions recently discovered by Tolman. A discussion of the probable effect of deviations from the Fermi equation of state suggests that actual stellar matter after the exhaustion of thermonuclear sources of energy will, if massive enough, contract indefinitely, although more and more slowly, never reaching true equilibrium.
Figure 24.1.
Two important arrivals on the astrophysical scene: the neutron star (1933) and the black hole ( 1795 , 1939 ) ( 1795 , 1939 ) (1795,1939)(1795,1939)(1795,1939).
No proper account of either can forego general relativity.

DU MONDE,


\section*{EXPOSition
\section*{EXPOSition
,}
Par Pierre-Simon Laplace, de l'Institut National de France, et du Bureau des Longitudes.
TOME SECOND.
A PARIS,
De l'Imprimerie du Cercle-Social, rue du
Théàtre Français, N N N^(@)\mathrm{N}^{\circ}N. 4 .
s'an IV de la Rétubliquin Finggitse.

(305)

aussi sensibles à la distance qui nous etr separe ; ct combien ils doivent surpasser ceux que nous observons à la surface du soleil ? Tous ces corps devenus invisibles, sont à la même place où ils ont été observés, puisquiis n'en ont point changé, durant leur apparition; il existe donc dans les espaces célestes, des corps obscurs aussi considérables, et peut être en aussi grand nombre, que les. étoiles. Un astre lumineux de même densité que la terre, et dont le diamètre serait deux cents cinquante fois plus grand que celui du soleil, ne laisserait en vertu de son attraction, parvenir aucun de ses rayons jusqu"à nous; il est donc possible que les plus grands corps lumineux de l'univers, soient par cela même, invisibles. Une etoile qui, sans être de cette grandeur, surpasserait considerablement le soleil ; affaiblirait sensiblement la vitesse de la lumière, et augmenterait ainsi l'étendue de son aberration. Cette différence dans l'aberration des étoiles ; un catalogue de celles qui ne font que paraitre, et leur position observee au moment de leur éclat passager; la dè e termination de toutes les étoiles changeantes,
Tome IL v

On Continued Gravitational Contraction

J. R. Oppenheimer and H. SnyderUniversity of California, Berkeley, California

(Received July 10, 1939)
When all thermonuclear sources of energy are exhausted a sufficiently heavy star will collapse. Unless fission due to rotation, the radiation of mass, or the blowing off of mass by radiation, reduce the star's mass to the order of that of the sun, this contraction will continue indefinitely. In the present paper we study the solutions of the gravitational field equations which describe this process. In I, general and qualitative arguments are given on the behavior of the metrical tensor as the contraction progresses: the radius of the star approaches asymptotically its gravitational radius; light from the surface of the star is progressively reddened, and can escape over a progressively narrower range of angles. In II, an analytic solution of the field equations confirming these general arguments is obtained for the case that the pressure within the star can be neglected. The total time of collapse for an observer comoving with the stellar matter is finite, and for this idealized case and typical stellar masses, of the order of a day; an external observer sees the star asymptotically shrinking to its gravitational radius.
"Final state of stellar evolution," and "cold, catalyzed matter" defined
Equation of state for cold, catalyzed matter
What does one mean in principle by the term "the final state of stellar evolution"? Start with a star containing a given number, A A AAA, of baryons and let it evolve to the absolute, burned-out end point of thermonuclear combustion (minimum massenergy possible for the A A AAA-baryon system). If the normal course of thermonuclear combustion is too slow, speed it up by catalysis. If an explosion occurs, collect the outgoing matter, extract its kinetic energy, and let it fall back onto the system. Repeat this operation as many times as needed to arrive at burnout (cold Fe 56 Fe 56 Fe^(56)\mathrm{Fe}^{56}Fe56 for the part of the system under modest pressure; other nuclear species in the region closer to the center; "cold matter catalyzed to the end point of thermonuclear combustion" throughout). End up finally with the system in its absolutely lowest energy state, with all angular momentum removed and all heat extracted, so that it sits at the absolute zero of temperature and has zero angular velocity. Such a "dead" system, depending upon its mass and prior history (two distinct energy minima for certain A A AAA-values), ends up as a cold stellar configuration (neutron star, or "white" dwarf), or as a "dead" black hole.
The analysis of a cold stellar configuration demands an equation of state. The temperature is fixed at zero; the nuclear composition in principle is specified uniquely by the density; and therefore the pressure is also fixed uniquely once the density has been specified [equation of state p ( ρ ) p ( ρ ) p(rho)p(\rho)p(ρ) for "cold catalyzed matter"].
The white dwarfs and neutron stars observed by astronomers are not really built of cold catalyzed matter. However, the matter in them is sufficiently near the end point of thermonuclear evolution and sufficiently cold that it can be idealized with fair accuracy as cold and catalyzed (see §23.4).
The equation of state, ρ ( p ) ρ ( p ) rho(p)\rho(p)ρ(p), for cold catalyzed matter is shown graphically in Figure 24.2. This version of the equation of state was constructed by Harrison and Wheeler in 1958. Other versions constructed more recently [see Cameron (1970) and Baym, Bethe, and Pethick (1971) for references] are almost identical to the Harrison-Wheeler version at densities well below nuclear densities, ρ < 3 × 10 13 ρ < 3 × 10 13 rho < 3xx10^(13)\rho<3 \times 10^{13}ρ<3×1013 g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3. At nuclear and supernuclear densities, all versions differ because of differing assumptions about nucleon-nucleon interactions. Along with the equation of state, in Figure 24.2 are shown properties of the models of cold stars constructed from this equation of state by integrating numerically the equations of structure (23.28).
The equation of state can be understood by following the transformations that occur as a sample of cold catalyzed matter is compressed to higher and higher densities. At each stage in the compression, each possible thermonuclear reaction is to be catalyzed to its endpoint and the resultant thermal energy is to be removed.
When the sample is at zero pressure, it is a ball of pure, cold Fe 56 Fe 56 Fe^(56)\mathrm{Fe}^{56}Fe56, since Fe 56 Fe 56 Fe^(56)\mathrm{Fe}^{56}Fe56 is the most tightly bound of all nuclei. It has the density 7.86 g / cm 3 7.86 g / cm 3 7.86g//cm^(3)7.86 \mathrm{~g} / \mathrm{cm}^{3}7.86 g/cm3. As the sample is compressed, its internal pressure is provided at first by normal solid-state forces; but the atoms are soon squeezed so closely together that the electrons become quite oblivious of their nuclei, and begin to form a degenerate Fermi gas. By the time a density of ρ = 10 5 g / cm 3 ρ = 10 5 g / cm 3 rho=10^(5)g//cm^(3)\rho=10^{5} \mathrm{~g} / \mathrm{cm}^{3}ρ=105 g/cm3 has been reached, valence forces are completely negligible, the degenerate electron pressure dominates, and the compressibility index, γ γ gamma\gammaγ (see legend for Figure 24.2), is 5 / 3 5 / 3 5//35 / 35/3, the value for a nonrelativistically degenerate Fermi gas. Between 10 5 10 5 10^(5)10^{5}105 and 10 7 g / cm 3 10 7 g / cm 3 10^(7)g//cm^(3)10^{7} \mathrm{~g} / \mathrm{cm}^{3}107 g/cm3, the pressure-providing electrons gradually
Equation of state
Stellar models
Figure 24.2.
The Harrison-Wheeler equation of state for cold matter at the absolute end point of thermonuclear evolution, and the corresponding Harrison-Wakano-Wheeler stellar models. The equation of state is exhibited in the form of a plot of "compressibility index,"
γ = ρ + p p d p d ρ γ = ρ + p p d p d ρ gamma=(rho+p)/(p)(dp)/(d rho)\gamma=\frac{\rho+p}{p} \frac{d p}{d \rho}γ=ρ+ppdpdρ
as a function of density of mass-energy, ρ ρ rho\rhoρ. (Small γ γ gamma\gammaγ corresponds to easy compressibility.) The curve is parameterized by the logarithm of the pressure, log 10 p log 10 p log_(10)p\log _{10} plog10p, in units of g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3 [same units as ρ ρ rho\rhoρ; note that p ( g / cm 3 ) = ( 1 / c 2 ) × p ( dyne / cm 2 ) ] p g / cm 3 = 1 / c 2 × p dyne / cm 2 {:p((g)//cm^(3))=(1//c^(2))xx p(dyne//cm^(2))]\left.p\left(\mathrm{~g} / \mathrm{cm}^{3}\right)=\left(1 / c^{2}\right) \times p\left(\mathrm{dyne} / \mathrm{cm}^{2}\right)\right]p( g/cm3)=(1/c2)×p(dyne/cm2)]. The chemical composition of the matter as a function of density is indicated as follows: Fe , Fe 56 Fe , Fe 56 Fe,Fe^(56)\mathrm{Fe}, \mathrm{Fe}^{56}Fe,Fe56 nuclei; A , nuclei more neutron rich than Fe 56 Fe 56 Fe^(56)\mathrm{Fe}^{56}Fe56; e, electrons; n, free neutrons; p, free protons.
The first law of thermodynamics [equation (22.6)], when applied to cold matter (zero entropy) says d ρ / ( ρ + p ) = d n / n d ρ / ( ρ + p ) = d n / n d rho//(rho+p)=dn//nd \rho /(\rho+p)=d n / ndρ/(ρ+p)=dn/n; i.e.,
n = ρ + p μ Fe / 56 exp ( 0 p d p ρ + p ) n = ρ + p μ Fe / 56 exp 0 p d p ρ + p n=(rho+p)/(mu_(Fe)//56)exp(-int_(0)^(p)(dp)/(rho+p))n=\frac{\rho+p}{\mu_{\mathrm{Fe}} / 56} \exp \left(-\int_{0}^{p} \frac{d p}{\rho+p}\right)n=ρ+pμFe/56exp(0pdpρ+p)
Here μ Fe μ Fe mu_(Fe)\mu_{\mathrm{Fe}}μFe, the rest mass of an Fe 56 Fe 56 Fe^(56)\mathrm{Fe}^{56}Fe56 atom, is the ratio between ρ + p ρ ρ + p ρ rho+p~~rho\rho+p \approx \rhoρ+pρ and n / 56 n / 56 n//56n / 56n/56 in the limit of zero density. From this equation and a knowledge of ρ ( p ) ρ ( p ) rho(p)\rho(p)ρ(p)-(see Figure)-one can calculate n ( p ) n ( p ) n(p)n(p)n(p).
The equilibrium configurations are represented by curves of total mass-energy, M M MMM, versus radius, R R RRR. ( R R RRR is defined such that 4 π R 2 4 π R 2 4piR^(2)4 \pi R^{2}4πR2 is the star's surface area.) The M ( R ) M ( R ) M(R)M(R)M(R) curve is parameterized by the logarithm of the central density, log 10 ρ c log 10 ρ c log_(10)rho_(c)\log _{10} \rho_{c}log10ρc, measured in g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3. Only configurations along two branches of the curve are stable against small perturbations and can therefore exist in nature: the white dwarfs, with log 10 ρ c < 8.38 log 10 ρ c < 8.38 log_(10)rho_(c) < 8.38\log _{10} \rho_{c}<8.38log10ρc<8.38, and the neutron stars, with 13.43 < log 10 ρ c < 15.78 13.43 < log 10 ρ c < 15.78 13.43 < log_(10)rho_(c) < 15.7813.43<\log _{10} \rho_{c}<15.7813.43<log10ρc<15.78 (see Box 26.1).
For greater detail on both the equation of state and the equilibrium configurations, see Harrison, Thorne, Wakano, and Wheeler (1965); also, for an updated table of the equation of state, see Hartle and Thorne (1968).
become relativistically degenerate, and γ γ gamma\gammaγ approaches 4 / 3 4 / 3 4//34 / 34/3. Above ρ = 1.4 × 10 7 ρ = 1.4 × 10 7 rho=1.4 xx10^(7)\rho=1.4 \times 10^{7}ρ=1.4×107 g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3, the rest mass of 62 Fe 26 56 62 Fe 26 56 62Fe_(26)^(56)62 \mathrm{Fe}_{26}^{56}62Fe2656 nuclei, plus the rest mass of 44 electrons, plus the rather large Fermi kinetic energy of 44 electrons at the top of the Fermi sea, exceeds the rest mass of 56 Ni 28 62 56 Ni 28 62 56Ni_(28)^(62)56 \mathrm{Ni}_{28}^{62}56Ni2862 nuclei. Consequently, as the catalyzed sample of matter is compressed past ρ = 1.4 × 10 7 g / cm 3 ρ = 1.4 × 10 7 g / cm 3 rho=1.4 xx10^(7)g//cm^(3)\rho=1.4 \times 10^{7} \mathrm{~g} / \mathrm{cm}^{3}ρ=1.4×107 g/cm3, the nuclear reaction
(24.1) 62 Fe 26 56 ( highly compressed neutral atoms ) 56 Ni 28 62 ( highly compressed neutral atoms ) (24.1) 62 Fe 26 56 (  highly compressed   neutral atoms  ) 56 Ni 28 62 (  highly compressed   neutral atoms  ) {:[(24.1)62Fe_(26)^(56)((" highly compressed ")/(" neutral atoms "))longrightarrow],[56Ni_(28)^(62)((" highly compressed ")/(" neutral atoms "))]:}\begin{align*} & 62 \mathrm{Fe}_{26}^{56}\binom{\text { highly compressed }}{\text { neutral atoms }} \longrightarrow \tag{24.1}\\ & 56 \mathrm{Ni}_{28}^{62}\binom{\text { highly compressed }}{\text { neutral atoms }} \end{align*}(24.1)62Fe2656( highly compressed  neutral atoms )56Ni2862( highly compressed  neutral atoms )
goes to its end point, with a release of energy. As the compression continues beyond this point, the rising Fermi energy of the electrons induces new nuclear reactions similar to (24.1), but involving different nuclei. In these reactions more and more electrons are swallowed up to form new nuclei, which are more and more neutronrich. When the density reaches ρ = 3 × 10 11 g / cm 3 ρ = 3 × 10 11 g / cm 3 rho=3xx10^(11)g//cm^(3)\rho=3 \times 10^{11} \mathrm{~g} / \mathrm{cm}^{3}ρ=3×1011 g/cm3, the nuclei are so highly neutronrich ( Y 39 122 ) Y 39 122 (Y_(39)^(122))\left(\mathrm{Y}_{39}^{122}\right)(Y39122) that neutrons begin to drip off them. The matter now becomes highly compressible for a short time ( 3 × 10 11 ρ 4 × 10 11 ) 3 × 10 11 ρ 4 × 10 11 (3xx10^(11) <= rho <= 4xx10^(11))\left(3 \times 10^{11} \leqq \rho \leqq 4 \times 10^{11}\right)(3×1011ρ4×1011), since most of the remaining electrons are swallowed up very rapidly by the dripping nuclei. Above ρ 4 × 10 11 g / cm 3 ρ 4 × 10 11 g / cm 3 rho∼4xx10^(11)g//cm^(3)\rho \sim 4 \times 10^{11} \mathrm{~g} / \mathrm{cm}^{3}ρ4×1011 g/cm3 free neutrons become plentiful and their degeneracy pressure exceeds that of the electrons. Further compression to ρ 10 13 g / cm 3 ρ 10 13 g / cm 3 rho∼10^(13)g//cm^(3)\rho \sim 10^{13} \mathrm{~g} / \mathrm{cm}^{3}ρ1013 g/cm3 completely disintegrates the remaining nuclei, leaving the sample almost pure neutrons with γ = 5 / 3 γ = 5 / 3 gamma=5//3\gamma=5 / 3γ=5/3, the value for a nonrelativistically degenerate Fermi gas. Intermixed with the neutrons are just enough degenerate electrons to prevent the neutrons from decaying, and just enough protons to maintain charge neutrality. Compression beyond ρ 10 13 g / cm 3 ρ 10 13 g / cm 3 rho∼10^(13)g//cm^(3)\rho \sim 10^{13} \mathrm{~g} / \mathrm{cm}^{3}ρ1013 g/cm3 pushes the sample into the domain of nuclear densities where the physics of matter is only poorly understood. This Harrison-Wheeler version of the equation of state ignores all nucleon-nucleon interactions at abd above nuclear densities; it idealizes matter as a noninteracting mixture of neutrons, protons, and electrons with neutrons dominating; and it shows a compressibility index of 5 / 3 5 / 3 5//35 / 35/3 while the neutrons are nonrelativistic, but 4 / 3 4 / 3 4//34 / 34/3 after they attain relativistic Fermi energies. Other versions of the equation of state attempt to take into account the nucleonnucleon interactions in a variety of ways [see Cameron (1970), Baym, Bethe, and Pethick (1971), and many references cited therein].
Corresponding to each value of the central density, ρ c ρ c rho_(c)\rho_{c}ρc, there is one stellar equilibrium configuration. Equilibrium, yes; but is the equilibrium stable? Stability studies (Chapter 26, especially Box 26.1 ) show that many of the models are unstable against small radial perturbations, which lead to gravitational collapse. Only white-dwarf stars in the range log 10 ρ c < 8.4 log 10 ρ c < 8.4 log_(10)rho_(c) < 8.4\log _{10} \rho_{c}<8.4log10ρc<8.4 and neutron stars in the range 13.4 log 10 ρ c 15.8 13.4 log 10 ρ c 15.8 13.4 <= log_(10)rho_(c) <= 15.813.4 \leqq \log _{10} \rho_{c} \leqq 15.813.4log10ρc15.8 are stable. Instability for the region of log 10 ρ c log 10 ρ c log_(10)rho_(c)\log _{10} \rho_{c}log10ρc values between 8.4 and 13.4 is caused by a combination of (1) relativistic strengthening of the gravitational forces, and (2) high compressibility of the matter due to electron capture and neutron drip by
Equilibrium configurations for cold, catalyzed matter:
(1) forms and stability
the atomic nuclei. Neutron stars are stable for a simple reason. Neutron-dominated matter is so difficult to compress that even the relativistically strengthened gravitational forces cannot overcome it. Above log 10 ρ c 15.8 log 10 ρ c 15.8 log_(10)rho_(c)∼15.8\log _{10} \rho_{c} \sim 15.8log10ρc15.8, the gravitational forces become strong enough to win out over the pressure of the nuclear matter, and the stars ate all unstable. [See Gerlach (1968) for the possibility-which, however, he rates as unlikely-that there might exist a third family of stable equilibrium configurations, additional to white dwarfs and neutron stars.]
The white-dwarf stars have masses below 1.2 M 1.2 M 1.2M_(o.)1.2 M_{\odot}1.2M and radii between 3000 3000 ∼3000\sim 30003000 and 20 , 000 km 20 , 000 km ∼20,000km\sim 20,000 \mathrm{~km}20,000 km. They are supported almost entirely by the pressure of the degenerate electron gas. Relativistic deviations from Newtonian structure are only a fraction of a per cent, but relativistic effects on stability and pulsations are important from ρ c 10 8 g / cm 3 ρ c 10 8 g / cm 3 rho_(c)~~10^(8)g//cm^(3)\rho_{c} \approx 10^{8} \mathrm{~g} / \mathrm{cm}^{3}ρc108 g/cm3 to the upper limit of the white-dwarf family at ρ c = 10 8.4 g / cm 3 [ see ρ c = 10 8.4 g / cm 3 [ see rho_(c)=10^(8.4)g//cm^(3)[see\rho_{c}=10^{8.4} \mathrm{~g} / \mathrm{cm}^{3}[\mathrm{see}ρc=108.4 g/cm3[see, e.g., Faulkner and Gribbin (1968)]. The properties of white-dwarf models are fairly independent of whose version of the equation of state is used in the calculations.
The properties of neutron stars are moderately dependent on the equation of state used. However, all versions lead to upper and lower limits on the mass and central density. The correct lower limits probably lie in the range
(24.2) 13.4 log 10 ρ c min 14.0 0.05 M M min 0.2 M (24.2) 13.4 log 10 ρ c min 14.0 0.05 M M min 0.2 M {:[(24.2)13.4 <= log_(10)rho_(c min) <= 14.0],[0.05M_(o.) <= M_(min) <= 0.2M_(o.)]:}\begin{align*} & 13.4 \leqq \log _{10} \rho_{c \min } \leq 14.0 \tag{24.2}\\ & 0.05 M_{\odot} \leq M_{\min } \leq 0.2 M_{\odot} \end{align*}(24.2)13.4log10ρcmin14.00.05MMmin0.2M
the correct upper limits are probably in the range
(24.3) 15.0 log 10 ρ c max 16.0 , 0.5 M M max 3 M (24.3) 15.0 log 10 ρ c max 16.0 , 0.5 M M max 3 M {:[(24.3)15.0≲log_(10)rho_(c max)≲16.0","],[0.5M_(o.) <= M_(max)≲3M_(o.)]:}\begin{align*} & 15.0 \lesssim \log _{10} \rho_{c \max } \lesssim 16.0, \tag{24.3}\\ & 0.5 M_{\odot} \leqq M_{\max } \lesssim 3 M_{\odot} \end{align*}(24.3)15.0log10ρcmax16.0,0.5MMmax3M
[see Rhoades (1971)]. Neutron stars typically have radii between 6 km 6 km ∼6km\sim 6 \mathrm{~km}6 km and 100 100 ∼100\sim 100100 km . Relativistic deviations from Newtonian structure are great, sometimes more than 50 per cent.
It appears certain that no cold stellar configuration can have a mass exceeding 5 M [ 5 M ∼5M_(o.)[:}\sim 5 M_{\odot}\left[\right.5M[ Rhoades (1971)] (1.2 M M M_(o.)M_{\odot}M according to the Harrison-Wheeler equation of state, Figure 24.2). Any star more massive than this must reduce its mass below this limit if it is to fade away into quiet obscurity, otherwise relativistic gravitational forces will eventually pull it into catastrophic gravitational collapse past white-dwarf radii, past neutron-star radii, and into a black hole a few kilometers in size (see Part VII).

§24.3. PULSARS

Theory predicts that, when a star more massive than the Chandrasekhar limit of 1.2 M 1.2 M 1.2M_(o.)1.2 M_{\odot}1.2M has exhausted the nuclear fuel in its core and has compressed its core to white-dwarf densities, an instability pushes the star into catastrophic collapse. The
Birth of a neutron star by stellar collapse
Dynamics of a newborn neutron star
Neutron star as a pulsar
Pulsar radiation as a tool for studying neutron stars
core implodes upon itself until nucleon-nucleon repulsion halts the implosion. The result is a neutron star, unless the core's mass is so great that gravity overcomes the nucleon-nucleon repulsion and pulls the star on in to form a black hole. Not all the star's mass should become part of the neutron star or black hole. Much of it, perhaps most, can be ejected into interstellar space by the violence that accompanies the collapse-violence due to flash nuclear burning, shock waves, and energy transport by neutrinos ("stick of dynamite in center of star, ignited by collapse").
The collapsed core holds more interest for gravitation theory than the ejected envelope. That core, granted a mass small enough to avoid the black-hole fate, will initially be a hot, wildly pulsating, rapidly rotating glob of nuclear matter with a strong, embedded magnetic field (see Figure 24.3). The pulsations must die out quickly. They emit a huge flux of gravitational radiation, and radiation reaction damps them in a characteristic time of 1 1 ∼1\sim 11 second [see Wheeler (1966); Thorne (1969a)]. Moreover, the pulsations push and pull elementary particle reactions back and forth by raising and lowering the Fermi energies in the core's interior; these particle reactions can convert pulsation energy into heat at about the same rate as the pulsation energy is radiated by gravity. [See Langer and Cameron (1969); also §11.5 of Zel'dovich and Novikov (1971) for details and references.]
The result, after a few seconds, is a rapidly rotating centrifugally flattened neutron star with a strong (perhaps 10 12 10 12 10^(12)10^{12}1012 gauss) magnetic field; all the pulsations are gone. If the star is deformed from axial symmetry (e.g., by centrifugal forces or by a nonsymmetric magnetic field), its rotation produces a steady outgoing stream of gravitational waves, which act back on the star to remove rotational energy. Whether or not this occurs, the rotating magnetic field itself radiates electromagnetic waves. They slow the rotation and transport energy into the surrounding, exploding gas cloud (nebula). [See Pacini (1968), Goldreich and Julian (1968), and Ostriker and Gunn (1969) for basic considerations.]
Somehow, but nobody understands in detail how, the rotating neutron star beams coherent radio waves and light out into space. Each time the beam sweeps past the Earth optical and radio telescopes see a pulse of radiation. The light is emitted synchronously with the radio waves, but the light pulses reach Earth earlier ( 1 1 ∼1\sim 11 second for the pulsar in the crab nebula) because of the retardation of the radio waves by the plasma along the way. This is the essence of the 1973 theory of pulsars, accepted by most astrophysicists.
Although the mechanism of coherent emission is not understood, the pulsar radiation can nevertheless be a powerful tool in the experimental study of neutron stars. Anything that affects the stellar rotation rate, even minutely (fractional changes as small as 10 9 10 9 10^(-9)10^{-9}109 ) will produce measurable irregularities in the timing of the pulses at Earth. If the star's crust and mantle are crystalline, as 1973 theory predicts, they may be subject to cracking, faulting, or slippage ("starquake") that changes the moment of inertia, and thence the rotation rate. Debris falling into the star will also change its rotation. Whichever the cause, after such a disturbance the star may rotate differentially for awhile; and how it returns to rigid rotation may depend on such phenomena as superfluidity in its deep interior. Thus, pulsar-timing data may eventually give information about the interior and crust of the neutron star, and
Figure 24.3.
"Collapse, pursuit, and plunge scenario" [schematic from Ruffini and Wheeler (1971b)].
  • A star with white-dwarf core (A), slowly rotating,
  • evolves by straightforward astrophysics,
  • arrives at the point of gravitational instability,
  • collapses, and
  • ends up as a rapidly spinning neutron-star pancake ( B , B B , B B,B^(')\mathrm{B}, \mathrm{B}^{\prime}B,B ).
  • It then fragments (C) because it has too much angular momentum to collapse into a single stable object. If the substance of the neutron-star pancake were an incompressible fluid, the fragmentation would have a close tie to well-known and often observed phenomena ("drop formation"). However, the more massive a neutron star is, the smaller it is, so one's insight into this and subsequent stages of the scenario are of necessity subject to correction or amendment. One can not today guarantee that fragmentation takes place at all; nevertheless, fragmentation will be assumed in what follows.
  • The fragments dissipate energy and angular momentum via gravitational radiation.
  • One by one as they revolve they coalesce ("pursuit and plunge scenario").
  • In each such plunge a pulse of gravitational radiation emerges.
  • Fragments of debris fall onto the coalesced objects (neutron stars or black holes, as the case may be), changing their angular momenta.
  • Eventually the distinct neutron stars or black holes or both unite into one such collapsed object with a final pulse of gravitational radiation.
  • The details of the complete scenario differ completely from one evolving star to another, depending on
  • the mass of its core, and
  • the angular momentum of this core.
  • An entirely different kind of picture therefore has to be drawn for altered values of these two parameters.
  • Even for the values of these parameters adopted in the drawing, the present picture can at best possess only qualitative validity.
  • Detailed computer analysis would seem essential for any firm prediction about the course of any selected scenario.
    thence (by combination with theory) about its mass and radius. These issues are discussed in detail in a review article by Ruderman (1972) as well as in Zel'dovich and Novikov (1971).

§24.4. SUPERMASSIVE STARS AND STELLAR INSTABILITIES

Theory of the stability of Newtonian stars
When a Newtonian star of mass M M MMM oscillates adiabatically in its fundamental mode, the change in its radius, δ R δ R delta R\delta RδR, obeys a harmonic-oscillator equation,
(24.4) M δ R ¨ = k δ R , (24.4) M δ R ¨ = k δ R , {:(24.4)M deltaR^(¨)=-k delta R",":}\begin{equation*} M \delta \ddot{R}=-k \delta R, \tag{24.4} \end{equation*}(24.4)MδR¨=kδR,
with a "spring constant" k k kkk that depends on the star's mean adiabatic index Γ ¯ 1 Γ ¯ 1 bar(Gamma)_(1)\bar{\Gamma}_{1}Γ¯1 [recall: Γ 1 ( n / p ) ( p / n ) const. entropy Γ 1 ( n / p ) ( p / n ) const. entropy  Gamma_(1)-=(n//p)(del p//del n)_("const. entropy ")\Gamma_{1} \equiv(n / p)(\partial p / \partial n)_{\text {const. entropy }}Γ1(n/p)(p/n)const. entropy  ], on its gravitational potential energy Ω Ω Omega\OmegaΩ, on the trace I = ρ r 2 d V I = ρ r 2 d V I=int rhor^(2)dVI=\int \rho r^{2} d \mathscr{V}I=ρr2dV of the second moment of its mass distribution, and on its mass M M MMM,
(24.5) k = 3 M ( Γ ¯ 1 4 / 3 ) | Ω | / I (24.5) k = 3 M Γ ¯ 1 4 / 3 | Ω | / I {:(24.5)k=3M( bar(Gamma)_(1)-4//3)|Omega|//I:}\begin{equation*} k=3 M\left(\bar{\Gamma}_{1}-4 / 3\right)|\Omega| / I \tag{24.5} \end{equation*}(24.5)k=3M(Γ¯14/3)|Ω|/I
(See Box 24.2). If Γ ¯ 1 > 4 / 3 Γ ¯ 1 > 4 / 3 bar(Gamma)_(1) > 4//3\bar{\Gamma}_{1}>4 / 3Γ¯1>4/3 the Newtonian star is stable and oscillates; if Γ ¯ 1 < 4 / 3 Γ ¯ 1 < 4 / 3 bar(Gamma)_(1) < 4//3\bar{\Gamma}_{1}<4 / 3Γ¯1<4/3 the star is unstable and either collapses or explodes, depending on its initial conditions and overall energetics. This result is a famous theorem in Newtonian stellar theory-but it is relevant only for adiabatic oscillations.

Box 24.2 OSCILLATION OF A NEWTONIAN STAR

The following is a volume-averaged analysis of the lowest mode of radial oscillation. Such analyses are useful in understanding the qualitative behavior and stability of a star. [See Zel'dovich and Novikov (1971) for an extensive exploitation of them.] However, for precise quantitative results, one must perform a more detailed analysis [see, e.g., Ledoux and Walraven (1958); also Chapter 26 of this book].
  1. Let M = M = M=M=M= star's total mass
    R = R = R=R=R= star's radius
    ρ ¯ = ρ ¯ = bar(rho)=\bar{\rho}=ρ¯= mean density = ( 3 / 4 π ) M / R 3 = ( 3 / 4 π ) M / R 3 =(3//4pi)M//R^(3)=(3 / 4 \pi) M / R^{3}=(3/4π)M/R3
    p ¯ = p ¯ = bar(p)=\bar{p}=p¯= mean pressure
    Γ ¯ 1 = Γ ¯ 1 = bar(Gamma)_(1)=\bar{\Gamma}_{1}=Γ¯1= mean adiabatic index = ( n ¯ / p ¯ ) ( p ¯ / n ¯ ) adiabatic = ( n ¯ / p ¯ ) ( p ¯ / n ¯ ) adiabatic  =( bar(n)// bar(p))(del bar(p)//del bar(n))_("adiabatic ")=(\bar{n} / \bar{p})(\partial \bar{p} / \partial \bar{n})_{\text {adiabatic }}=(n¯/p¯)(p¯/n¯)adiabatic 
    = ( ρ ¯ / p ¯ ) ( p ¯ / ρ ¯ ) adiabatic = ( ρ ¯ / p ¯ ) ( p ¯ / ρ ¯ ) adiabatic  =( bar(rho)// bar(p))(del bar(p)//del bar(rho))_("adiabatic ")=(\bar{\rho} / \bar{p})(\partial \bar{p} / \partial \bar{\rho})_{\text {adiabatic }}=(ρ¯/p¯)(p¯/ρ¯)adiabatic  in Newtonian limit, where ρ = ρ = rho=\rho=ρ= const. × n × n xx n\times n×n.
  2. Then the mean pressure-buoyancy force F ¯ buoy F ¯ buoy  bar(F)_("buoy ")\bar{F}_{\text {buoy }}F¯buoy  and the counterbalancing gravitational force F ¯ grav F ¯ grav  bar(F)_("grav ")\bar{F}_{\text {grav }}F¯grav  in the equilibrium star are
F ¯ buoy = p ¯ / R = F ¯ grav = ρ ¯ M / R 2 = ( 4 π / 3 ) ρ ¯ 2 R . F ¯ buoy  = p ¯ / R = F ¯ grav  = ρ ¯ M / R 2 = ( 4 π / 3 ) ρ ¯ 2 R . {:[ bar(F)_("buoy ")= bar(p)//R],[= bar(F)_("grav ")= bar(rho)M//R^(2)=(4pi//3) bar(rho)^(2)R.]:}\begin{aligned} \bar{F}_{\text {buoy }} & =\bar{p} / R \\ & =\bar{F}_{\text {grav }}=\bar{\rho} M / R^{2}=(4 \pi / 3) \bar{\rho}^{2} R . \end{aligned}F¯buoy =p¯/R=F¯grav =ρ¯M/R2=(4π/3)ρ¯2R.
  1. When the oscillating star has expanded or contracted so its radius is R + δ R R + δ R R+delta RR+\delta RR+δR, then its mean density will have changed to
ρ ¯ + δ ρ ¯ = ( 3 / 4 π ) M [ R 3 + δ ( R 3 ) ] = ρ ¯ 3 ( ρ ¯ / R ) δ R , ρ ¯ + δ ρ ¯ = ( 3 / 4 π ) M R 3 + δ R 3 = ρ ¯ 3 ( ρ ¯ / R ) δ R , bar(rho)+delta bar(rho)=(3//4pi)M[R^(-3)+delta(R^(-3))]= bar(rho)-3( bar(rho)//R)delta R,\bar{\rho}+\delta \bar{\rho}=(3 / 4 \pi) M\left[R^{-3}+\delta\left(R^{-3}\right)\right]=\bar{\rho}-3(\bar{\rho} / R) \delta R,ρ¯+δρ¯=(3/4π)M[R3+δ(R3)]=ρ¯3(ρ¯/R)δR,
and its mean pressure will be
p ¯ + δ p ¯ = p ¯ + ( p ¯ / ρ ¯ ) Γ ¯ 1 δ ρ ¯ = p ¯ 3 ( Γ ¯ 1 p ¯ / R ) δ R . p ¯ + δ p ¯ = p ¯ + ( p ¯ / ρ ¯ ) Γ ¯ 1 δ ρ ¯ = p ¯ 3 Γ ¯ 1 p ¯ / R δ R . bar(p)+delta bar(p)= bar(p)+( bar(p)// bar(rho)) bar(Gamma)_(1)delta bar(rho)= bar(p)-3( bar(Gamma)_(1)( bar(p))//R)delta R.\bar{p}+\delta \bar{p}=\bar{p}+(\bar{p} / \bar{\rho}) \bar{\Gamma}_{1} \delta \bar{\rho}=\bar{p}-3\left(\bar{\Gamma}_{1} \bar{p} / R\right) \delta R .p¯+δp¯=p¯+(p¯/ρ¯)Γ¯1δρ¯=p¯3(Γ¯1p¯/R)δR.
The corresponding changes in the forces will be
δ F ¯ buoy = δ p ¯ R p ¯ R 2 δ R = ( 3 Γ 1 + 1 ) p ¯ R δ R R = ( 3 Γ ¯ 1 + 1 ) F ¯ buoy ( δ R R ) δ F ¯ grav = ( 4 π 3 ) ( 2 ρ ¯ R δ ρ ¯ + ρ ¯ 2 δ R ) = ( 4 π 3 ρ ¯ 2 R ) ( 5 δ R R ) = 5 F ¯ grav ( δ R R ) δ F ¯ buoy  = δ p ¯ R p ¯ R 2 δ R = 3 Γ 1 + 1 p ¯ R δ R R = 3 Γ ¯ 1 + 1 F ¯ buoy  δ R R δ F ¯ grav  = 4 π 3 2 ρ ¯ R δ ρ ¯ + ρ ¯ 2 δ R = 4 π 3 ρ ¯ 2 R 5 δ R R = 5 F ¯ grav  δ R R {:[delta bar(F)_("buoy ")=(delta( bar(p)))/(R)-(( bar(p)))/(R^(2))delta R=-(3Gamma_(1)+1)(( bar(p)))/(R)(delta R)/(R)=-(3 bar(Gamma)_(1)+1) bar(F)_("buoy ")((delta R)/(R))],[delta bar(F)_("grav ")=((4pi)/(3))(2( bar(rho))R delta( bar(rho))+ bar(rho)^(2)delta R)=((4pi)/(3) bar(rho)^(2)R)(-5(delta R)/(R))=-5 bar(F)_("grav ")((delta R)/(R))]:}\begin{aligned} & \delta \bar{F}_{\text {buoy }}=\frac{\delta \bar{p}}{R}-\frac{\bar{p}}{R^{2}} \delta R=-\left(3 \Gamma_{1}+1\right) \frac{\bar{p}}{R} \frac{\delta R}{R}=-\left(3 \bar{\Gamma}_{1}+1\right) \bar{F}_{\text {buoy }}\left(\frac{\delta R}{R}\right) \\ & \delta \bar{F}_{\text {grav }}=\left(\frac{4 \pi}{3}\right)\left(2 \bar{\rho} R \delta \bar{\rho}+\bar{\rho}^{2} \delta R\right)=\left(\frac{4 \pi}{3} \bar{\rho}^{2} R\right)\left(-5 \frac{\delta R}{R}\right)=-5 \bar{F}_{\text {grav }}\left(\frac{\delta R}{R}\right) \end{aligned}δF¯buoy =δp¯Rp¯R2δR=(3Γ1+1)p¯RδRR=(3Γ¯1+1)F¯buoy (δRR)δF¯grav =(4π3)(2ρ¯Rδρ¯+ρ¯2δR)=(4π3ρ¯2R)(5δRR)=5F¯grav (δRR)
Consequently, the restoring force will be (recall: F ¯ buoy = F ¯ grav F ¯ buoy  = F ¯ grav  bar(F)_("buoy ")= bar(F)_("grav ")\bar{F}_{\text {buoy }}=\bar{F}_{\text {grav }}F¯buoy =F¯grav  )
δ F ¯ grav δ F ¯ buoy = 3 ( Γ ¯ 1 4 3 ) F ¯ grav δ R R . δ F ¯ grav  δ F ¯ buoy  = 3 Γ ¯ 1 4 3 F ¯ grav  δ R R . delta bar(F)_("grav ")-delta bar(F)_("buoy ")=3( bar(Gamma)_(1)-(4)/(3)) bar(F)_("grav ")(delta R)/(R).\delta \bar{F}_{\text {grav }}-\delta \bar{F}_{\text {buoy }}=3\left(\bar{\Gamma}_{1}-\frac{4}{3}\right) \bar{F}_{\text {grav }} \frac{\delta R}{R} .δF¯grav δF¯buoy =3(Γ¯143)F¯grav δRR.
  1. This restoring force produces an acceleration,
δ F ¯ grav δ F ¯ buoy = ρ ¯ δ R ¨ δ F ¯ grav  δ F ¯ buoy  = ρ ¯ δ R ¨ delta bar(F)_("grav ")-delta bar(F)_("buoy ")=- bar(rho)deltaR^(¨)\delta \bar{F}_{\text {grav }}-\delta \bar{F}_{\text {buoy }}=-\bar{\rho} \delta \ddot{R}δF¯grav δF¯buoy =ρ¯δR¨
Hence, the equation of motion for the oscillations is
δ R ¨ = 3 ( Γ ¯ 1 4 / 3 ) ( 4 π / 3 ) ρ ¯ δ R , δ R ¨ = 3 Γ ¯ 1 4 / 3 ( 4 π / 3 ) ρ ¯ δ R , deltaR^(¨)=-3( bar(Gamma)_(1)-4//3)(4pi//3) bar(rho)delta R,\delta \ddot{R}=-3\left(\bar{\Gamma}_{1}-4 / 3\right)(4 \pi / 3) \bar{\rho} \delta R,δR¨=3(Γ¯14/3)(4π/3)ρ¯δR,
corresponding to a "spring constant" k k kkk and angular frequency of oscillation ω ω omega\omegaω, given by ω 2 = 4 π ( Γ ¯ 1 4 / 3 ) ρ ¯ ω 2 = 4 π Γ ¯ 1 4 / 3 ρ ¯ omega^(2)=4pi( bar(Gamma)_(1)-4//3) bar(rho)\omega^{2}=4 \pi\left(\bar{\Gamma}_{1}-4 / 3\right) \bar{\rho}ω2=4π(Γ¯14/3)ρ¯, and k = M ω 2 k = M ω 2 k=Momega^(2)k=M \omega^{2}k=Mω2.
5. A more nearly exact analysis (see exercise 39.7 for details, or Box 26.2 for an alternative derivation) yields the improved formula
ω 2 = 3 ( Γ ¯ 1 4 / 3 ) | Ω | / I , Ω = ( star's self-gravitational energy ) = 1 2 ρ Φ d V = 1 2 ρ ρ | x x | d V d V , I = ( trace of second moment of star's mass distribution ) = ρ r 2 d V , ω 2 = 3 Γ ¯ 1 4 / 3 | Ω | / I , Ω = (  star's self-gravitational   energy  ) = 1 2 ρ Φ d V = 1 2 ρ ρ x x d V d V , I = (  trace of second moment of   star's mass distribution  ) = ρ r 2 d V , {:[omega^(2)=3( bar(Gamma)_(1)-4//3)|Omega|//I","],[Omega=((" star's self-gravitational ")/(" energy "))=(1)/(2)int rho Phi dV=-(1)/(2)int(rhorho^('))/(|x-x^(')|)dVdV^(')","],[I=((" trace of second moment of ")/(" star's mass distribution "))=int rhor^(2)dV","]:}\begin{gathered} \omega^{2}=3\left(\bar{\Gamma}_{1}-4 / 3\right)|\Omega| / I, \\ \Omega=\binom{\text { star's self-gravitational }}{\text { energy }}=\frac{1}{2} \int \rho \Phi d \mathscr{V}=-\frac{1}{2} \int \frac{\rho \rho^{\prime}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d \mathscr{V} d V^{\prime}, \\ I=\binom{\text { trace of second moment of }}{\text { star's mass distribution }}=\int \rho r^{2} d \mathscr{V}, \end{gathered}ω2=3(Γ¯14/3)|Ω|/I,Ω=( star's self-gravitational  energy )=12ρΦdV=12ρρ|xx|dVdV,I=( trace of second moment of  star's mass distribution )=ρr2dV,
for the square of the oscillation frequency.
6. Note that Γ ¯ 1 > 4 / 3 Γ ¯ 1 > 4 / 3 bar(Gamma)_(1) > 4//3\bar{\Gamma}_{1}>4 / 3Γ¯1>4/3 corresponds to stable oscillations; Γ ¯ 1 < 4 / 3 Γ ¯ 1 < 4 / 3 bar(Gamma)_(1) < 4//3\bar{\Gamma}_{1}<4 / 3Γ¯1<4/3 corresponds to exponentially developing collapse or explosion.
Stability theory predicts
"engine-driven oscillations" and quick death for stars of M > 60 M M > 60 M M > 60M_(o.)M>60 M_{\odot}M>60M
Possible existence of supermassive stars
Relativistic instabilities in a supermassive star
In a real star no oscillation is precisely adiabatic. The oscillations in temperature cause corresponding oscillations in the stellar opacity and in nuclear burning rates. These insert energy into or extract energy from the gas vibrations.
All main-sequence stars thus far observed and studied have masses below 60 M 60 M 60M_(o.)60 M_{\odot}60M. For such small masses, theory predicts low enough temperatures that gas pressure dominates over radiation pressure, and the adiabatic index is nearly that of nonrelativistic gas, Γ ¯ 1 5 / 3 Γ ¯ 1 5 / 3 bar(Gamma)_(1)~~5//3\bar{\Gamma}_{1} \approx 5 / 3Γ¯15/3. Such stars vibrate stably. The net effect of the oscillating opacity and burning rate is usually to extract energy from the vibrations. Thus, they damp. (The vibrations of Cepheid variable stars are a notable exception.)
No one has yet seen a main-sequence star with mass above about 60 M 60 M 60M_(o.)60 M_{\odot}60M. This is explained as follows. For masses above 60 M 60 M 60M_(o.)60 M_{\odot}60M, the temperature should be so high that radiation pressure dominates over gas pressure, and the adiabatic index Γ ¯ 1 Γ ¯ 1 bar(Gamma)_(1)\bar{\Gamma}_{1}Γ¯1 is only slightly above the value 4 / 3 4 / 3 4//34 / 34/3 for pure radiation. Consequently the "spring constant" of the star, although positive, is very small. On the inward stroke of an oscillation, the central temperature rises, and nuclear burning speeds up. (The nuclear burning rate goes as a very high power of the central temperature; for example, in a massive star HCNO burning releases energy at a rate ε HCNO T c 11 ε HCNO T c 11 epsi_(HCNO)propT_(c)^(11)\varepsilon_{\mathrm{HCNO}} \propto T_{c}{ }^{11}εHCNOTc11.) Because the spring constant is so small, the inward stroke lasts for a long time, and the enhanced nuclear burning produces a significant excess of thermal energy and pressure. Hence, on the outward stroke the star expands more vigorously than it contracted ("engine"). Successive vibrations are driven to higher and higher amplitudes. Eventually, calculations suggest, the star either explodes, or it ejects enough mass by its vigorous vibrations to drop below the critical limit of M 60 M M 60 M M∼60M_(o.)M \sim 60 M_{\odot}M60M. Hence, stars of mass above 60 M 60 M 60M_(o.)60 M_{\odot}60M should not live long enough that astronomers could have a reasonable probability of discovering them.
Of course, this "engine action" does not prevent massive stars from forming, living a short time, and then disrupting themselves. Such a possibility is particularly intriguing for supermassive stars [ M M MMM between 10 3 M 10 3 M 10^(3)M_(o.)10^{3} M_{\odot}103M and 10 9 M 0.01 × 10 9 M 0.01 × 10^(9)M_(o.)∼0.01 xx10^{9} M_{\odot} \sim 0.01 \times109M0.01× (mass of a galaxy)]. Although such stars may be exceedingly rare, by their huge masses and huge release of explosive energy they might play an important role in the universe. Moreover, it is conceivable that the oscillations of such stars, like those of Cepheid variables, might be sustained at large amplitudes for long times (a million years?), with nonlinear damping processes preventing their further growth.
Theory predicts that general relativistic effects should strongly influence the oscillations of a supermassive star. The increase in "gravitational force," δ F grav δ F grav  deltaF_("grav ")\delta F_{\text {grav }}δFgrav , acting on a shell of matter on the inward stroke is greater in general relativity than in Newtonian theory, and the decrease on the outward stroke is also greater. Consequently the "effective index" Γ 1 crit Γ 1  crit  Gamma_(1" crit ")\Gamma_{1 \text { crit }}Γ1 crit  of gravitational forces is increased above the Newtonian value of 4 / 3 4 / 3 4//34 / 34/3; thus,
( fractional increase in "pressure-like force of gravity" per unit fractional change in baryon-number density ) Γ 1 crit = ( 4 / 3 ) + α ( M / R ) + O ( M 2 / R 2 )  fractional increase in   "pressure-like force of   gravity" per unit fractional   change in baryon-number   density  Γ 1  crit  = ( 4 / 3 ) + α ( M / R ) + O M 2 / R 2 ([" fractional increase in "],[" "pressure-like force of "],[" gravity" per unit fractional "],[" change in baryon-number "],[" density "])-=Gamma_(1" crit ")=(4//3)+alpha(M//R)+O(M^(2)//R^(2))\left(\begin{array}{l}\text { fractional increase in } \\ \text { "pressure-like force of } \\ \text { gravity" per unit fractional } \\ \text { change in baryon-number } \\ \text { density }\end{array}\right) \equiv \Gamma_{1 \text { crit }}=(4 / 3)+\alpha(M / R)+O\left(M^{2} / R^{2}\right)( fractional increase in  "pressure-like force of  gravity" per unit fractional  change in baryon-number  density )Γ1 crit =(4/3)+α(M/R)+O(M2/R2),
where α α alpha\alphaα is a constant of the order of unity that depends on the structure of the star (see Box 26.2). To resist gravity, one has only the elasticity of the relativistic material of the star:
(24.7) ( fractional increase in "pressure-like resisting force" per unit fractional change in baryon number density ) = Γ ¯ 1 = p n ( p n ) s effective average over star (24.7)  fractional increase in   "pressure-like resisting   force" per unit fractional   change in baryon number   density  = Γ ¯ 1 = p n p n s  effective average   over star  {:(24.7)([" fractional increase in "],[" "pressure-like resisting "],[" force" per unit fractional "],[" change in baryon number "],[" density "])= bar(Gamma)_(1)=(:(p)/(n)((del p)/(del n))_(s):)_({:[" effective average "],[" over star "]:}):}\left(\begin{array}{l} \text { fractional increase in } \tag{24.7}\\ \text { "pressure-like resisting } \\ \text { force" per unit fractional } \\ \text { change in baryon number } \\ \text { density } \end{array}\right)=\bar{\Gamma}_{1}=\left\langle\frac{p}{n}\left(\frac{\partial p}{\partial n}\right)_{s}\right\rangle_{\substack{\text { effective average } \\ \text { over star }}}(24.7)( fractional increase in  "pressure-like resisting  force" per unit fractional  change in baryon number  density )=Γ¯1=pn(pn)s effective average  over star 
The effective spring constant for the vibrations of the star is governed by the delicate margin between these two indices:
k = ( effective spring constant ) = ( contribution of "elastic forces" ) ( contribution of gravity ) (24.8) = 3 M ( Γ ¯ 1 Γ 1 crit ) | Ω | I k = (  effective   spring constant  ) =  contribution   of "elastic   forces"  (  contribution   of gravity  ) (24.8) = 3 M Γ ¯ 1 Γ 1  crit  | Ω | I {:[k=((" effective ")/(" spring constant "))=([" contribution "],[" of "elastic "],[" forces" "])-((" contribution ")/(" of gravity "))],[(24.8)=3M( bar(Gamma)_(1)-Gamma_(1" crit "))(|Omega|)/(I)]:}\begin{align*} k=\binom{\text { effective }}{\text { spring constant }} & =\left(\begin{array}{l} \text { contribution } \\ \text { of "elastic } \\ \text { forces" } \end{array}\right)-\binom{\text { contribution }}{\text { of gravity }} \\ & =3 M\left(\bar{\Gamma}_{1}-\Gamma_{1 \text { crit }}\right) \frac{|\Omega|}{I} \tag{24.8} \end{align*}k=( effective  spring constant )=( contribution  of "elastic  forces" )( contribution  of gravity )(24.8)=3M(Γ¯1Γ1 crit )|Ω|I
(derivation in Chapter 26). The relativistic rise in the effective index of gravity above 4 / 3 4 / 3 4//34 / 34/3 [equation (24.6)] brings on the transition from stability (positive k k kkk; vibration) to instability (negative k k kkk; explosion or collapse) under conditions when one otherwise would have expected stability. For supermassive stars, Fowler and Hoyle (1964) show that
Γ ¯ 1 = 4 / 3 + ζ ( M / M ) 1 / 2 Γ ¯ 1 = 4 / 3 + ζ M / M 1 / 2 bar(Gamma)_(1)=4//3+zeta(M//M_(o.))^(-1//2)\bar{\Gamma}_{1}=4 / 3+\zeta\left(M / M_{\odot}\right)^{-1 / 2}Γ¯1=4/3+ζ(M/M)1/2
where ζ ζ zeta\zetaζ is a constant of order unity. As a newly formed supermassive star contracts inward, heating up, but not yet hot enough to ignite its nuclear fuel, it approaches nearer and nearer to instability against collapse. Unless burning halts the contraction, collapse sets in at a radius R crit R crit  R_("crit ")R_{\text {crit }}Rcrit  given by
Γ ¯ 1 = 4 / 3 + ζ ( M / M ) 1 / 2 = Γ 1 crit = 4 / 3 + α M / R ; Γ ¯ 1 = 4 / 3 + ζ M / M 1 / 2 = Γ 1  crit  = 4 / 3 + α M / R ; bar(Gamma)_(1)=4//3+zeta(M//M_(o.))^(-1//2)=Gamma_(1" crit ")=4//3+alpha M//R;\bar{\Gamma}_{1}=4 / 3+\zeta\left(M / M_{\odot}\right)^{-1 / 2}=\Gamma_{1 \text { crit }}=4 / 3+\alpha M / R ;Γ¯1=4/3+ζ(M/M)1/2=Γ1 crit =4/3+αM/R;
i.e.,
R = ( α / 2 ζ ) ( M / M ) 1 / 2 × ( Schwarzschild Radius ) 10 4 × ( Schwarzschild Radius ) if M = 10 8 M R = ( α / 2 ζ ) M / M 1 / 2 × (  Schwarzschild Radius  ) 10 4 × (  Schwarzschild Radius  )  if  M = 10 8 M {:[R=(alpha//2zeta)(M//M_(o.))^(1//2)xx(" Schwarzschild Radius ")],[∼10^(4)xx(" Schwarzschild Radius ")" if "M=10^(8)M_(o.)]:}\begin{aligned} R & =(\alpha / 2 \zeta)\left(M / M_{\odot}\right)^{1 / 2} \times(\text { Schwarzschild Radius }) \\ & \sim 10^{4} \times(\text { Schwarzschild Radius }) \text { if } M=10^{8} M_{\odot} \end{aligned}R=(α/2ζ)(M/M)1/2×( Schwarzschild Radius )104×( Schwarzschild Radius ) if M=108M
The relativistic instability occurs far outside the Schwarzschild radius when the star is very massive. Relativity hardly modifies the star's structure at all; but because of the delicate balance between δ F ¯ grav δ F ¯ grav  delta bar(F)_("grav ")\delta \bar{F}_{\text {grav }}δF¯grav  and δ F ¯ buoy δ F ¯ buoy  delta bar(F)_("buoy ")\delta \bar{F}_{\text {buoy }}δF¯buoy  in the Newtonian oscillations (Box 24.2), tiny relativistic corrections to these forces can completely change the stability.
In practice, the story of a supermassive star is far more complicated than has been indicated here. Rotation can stabilize it against relativistic collapse for a while. However, after the star has lost all angular momentum in excess of the critical value
Temporary stabilization by rotation
Possible scenarios for evolution and death of a supermassive star
J crit = M 2 J crit  = M 2 J_("crit ")=M^(2)J_{\text {crit }}=M^{2}Jcrit =M2 ("extreme Kerr limit"; see Chapter 33), and after it has contracted to near the Schwarzschild radius, rotation is helpless to stave off implosion. Depending on its mass and angular momentum, the star may ignite its fuel before or after relativistic collapse begins, and before or after implosion through the Schwarzschild radius. When the fuel is ignited, it can wreak havoc, because even if the star is not then imploding, its adiabatic index will be very near the critical one, and the burning may drive oscillations to higher and higher amplitudes. These processes are so complex that in 1973 one is far from having satisfactory analyses of them, but for reviews of what is known and has been done, the reader can consult Fowler (1966), Thorne (1967), and Zel'dovich and Novikov (1971).
The theory of stellar pulsations in general relativity is presented for Track-2 readers in Chapter 26 of this book.

§24.5. QUASARS AND EXPLOSIONS IN GALACTIC NUCLEI

Supermassive stars were first conceived by Hoyle and Fowler (1963a,b) as an explanation for explosions in the nuclei of galaxies. Shortly thereafter, when quasars were discovered, Hoyle and Fowler quite naturally appealed to their supermassive stars for an explanation of these puzzles as well. Whether galactic explosions or quasars are driven by supermassive stars remains a subject of debate in astronomical circles even as this book is being finished, in 1973. Hence, this book will avoid the issue except for the following remark.
Whatever is responsible for quasars and galactic explosions must be a machine of great mass ( M 10 6 M 10 6 M∼10^(6)M \sim 10^{6}M106 to 10 10 M 10 10 M 10^(10)M_(o.)10^{10} M_{\odot}1010M ) and small radius (light-travel time across the machine, as deduced from light variations, is sometimes less than a day). The machine might be a coherent object, i.e., a supermassive star; or it might be a dense mixture of ordinary stars and much gas. Actually these two possibilities may not be distinct. Star-star collisions in a dense cluster can lead to stellar coalescence and the gradual building up of one or more supermassive stars [Sanders (1970); Spitzer (1971); Colgate (1967)]. Thus, at one stage in its life, a galactic nucleus or quasar might be driven by collisions in a dense star cluster; and at a later stage it might be driven by a supermassive star; and at a still later stage that star might collapse to leave behind a massive black hole ( 10 6 10 9 M ) 10 6 10 9 M (10^(6)-10^(9)M_(o.))\left(10^{6}-10^{9} M_{\odot}\right)(106109M), but a black hole that is still "live" and active (Chapter 33).

§24.6. RELATIVISTIC STAR CLUSTERS

The normal astrophysical evolution of a galactic nucleus is estimated [Sanders (1970); Spitzer (1971)] to lead under some circumstances to a star cluster so dense that general relativity influences its structure and evolution. The theory of relativistic star clusters is closely related to that of relativistic stars, as developed in Chapter 23. A star is a swarm of gas molecules that collide frequently; a star cluster is a swarm of stars that collide rarely. But the frequency of collisions is relatively unim-
portant in a steady state. For the theory of relativistic star clusters, see: § 25.7 § 25.7 §25.7\S 25.7§25.7 of this book; Zel'dovich and Podurets (1965); Fackerell, Ipser, and Thorne (1969); Chapter 12 of Zel'dovich and Novikov (1971); and references cited there. A relativistic star cluster is a latent volcano. No future is evident for it except to evolve with enormous energy release to a massive black hole, either by direct collapse (possibly a star at a time) or by first coalescing into a supermassive star that later collapses.

снартев 25

THE "'PIT IN THE POTENTIAL" AS THE CENTRAL NEW FEATURE OF MOTION IN SCHWARZSCHILD GEOMETRY

"Eccentric, intervolved, yet regularThen most, when most irregular they seem; And in their motions harmony divine"

This chapter is entirely Track 2, except for Figures 25.2 and 25.6, and Boxes 25.6 and 25.7 (pp. 639, 660, 674, and 677), which Track-1 readers should peruse for insight and flavor. No earlier Track-2 material is needed as preparation for it.
§25.2 (symmetries) is needed as preparation for Box 30.2 (Mixmaster cosmology). The rest of the chapter is not essential for any later chapter, but it will be helpful in understanding
(1) Chapters 31-34
(gravitational collapse and black holes), and
(2) Chapter 40 (solar-system experiments).

§25.1. FROM KEPLER'S LAWS TO THE EFFECTIVE POTENTIAL FOR MOTION IN SCHWARZSCHILD GEOMETRY

No greater glory crowns Newton's theory of gravitation than the account it gives of the principal features of the solar system: a planet in its motion sweeps out equal areas in equal times; its orbit is an ellipse, with one focus at the sun; and the cube of the semimajor axis, a a aaa, of the ellipse, multiplied by the square of the average angular velocity of the planet in its orbit ( ω = 2 π / ω = 2 π / omega=2pi//\omega=2 \pi /ω=2π/ period) gives a number with the dimensions of a length, the same number for all the planets (Box 25.1), equal to the mass of the sun:
M = ω 2 a 3 M = ω 2 a 3 M=omega^(2)a^(3)M=\omega^{2} a^{3}M=ω2a3
Exactly the same is true for the satellites of Jupiter (Figure 25.1), and of the Earth (Box 25.1 ), and true throughout the heavens. What more can one possibly expect of Einstein's theory of gravity when it in its turn grapples with this centuries-old theme of a test object moving under the influence of a spherically symmetric center of attraction? The principal new result can be stated in a single sentence: The particle is governed by an "effective potential" (Figure 25.2 and § § 25.5 , 25.6 § § 25.5 , 25.6 §§25.5,25.6\S \S 25.5,25.6§§25.5,25.6 ) that possesses not only (1) the long distance M / r M / r -M//r-M / rM/r attractive behavior and (2) the shorter distance
(angular momentum) ) 2 / r 2 ) 2 / r 2 )^(2)//r^(2))^{2} / r^{2})2/r2 repulsive behavior of Newtonian gravitational theory, but also (3) at still shorter distances a pit in the potential, which (1) captures a particle that comes too close; (2) establishes a critical distance of closest approach for this black-hole capture process; (3) for a particle that approaches this critical point without crossing it, lengthens the turn-around time as compared to Newtonian expectations; and thereby (4) makes the period for a radial excursion longer than the period of a revolution; (5) causes an otherwise Keplerian orbit to precess; and (6) deflects a fast particle and a photon through larger angles than Newtonian theory would predict.
The pit in the potential being thus the central new feature of motion in Schwarzschild geometry and the source of major predictions (Box 25.2), it is appropriate to look for the most direct road into the concept of effective potential and its meaning
and application. In this search no guide is closer to hand than Newtonian mechanics.
Analytic mechanics offers several ways to deal with the problem of motion in a central field of force, and among them are two of central relevance here: (1) the world-line method, which includes second-order differential equations of motion, Lagrange's equations, search for constants of integration, reduction to first-order equations, and further integration in rather different ways according as one wants the shape of the orbit, θ = θ ( r ) θ = θ ( r ) theta=theta(r)\theta=\theta(r)θ=θ(r), or the time to get to a given point on the world line, t = t ( r ) t = t ( r ) t=t(r)t=t(r)t=t(r); and (2) the wave-crest method, otherwise known as the "eikonal method" or "Hamilton-Jacobi method," which gives the motion by the condition of "constructive interference of wave crests," thus making a single leap from the Hamilton-Jacobi equation to the motion of the test object. Both methods are em-
Figure 25.1.
Jupiter's satellites, as followed from night to night with field glasses or telescope, provide an opportunity to check for oneself the central ideas of gravitation physics in the Newtonian approximation (distances large compared to Schwarzschild radius). For the practically circular orbits of these satellites, Kepler's law becomes M 1 = ω 2 r 3 M 1 = ω 2 r 3 M^(1)=omega^(2)r^(3)M^{1}=\omega^{2} r^{3}M1=ω2r3 ("1-2-3 principle") and the velocity in orbit is β = ω r β = ω r beta=omega r\beta=\omega rβ=ωr. Out of observations on any two of the quantities β , M , ω , r β , M , ω , r beta,M,omega,r\beta, M, \omega, rβ,M,ω,r, one can find the other two. (In the opposite limiting case of two objects, each of mass M M MMM, going around their common center of gravity with separation r r rrr, one has M = ω 2 r 3 / 2 , β = ω r / 2 M = ω 2 r 3 / 2 , β = ω r / 2 M=omega^(2)r^(3)//2,beta=omega r//2M=\omega^{2} r^{3} / 2, \beta=\omega r / 2M=ω2r3/2,β=ωr/2 ). The configurations of satellites I-IV of Jupiter as given here for December 1964 (days 0.0 , 1.0 , 2.0 0.0 , 1.0 , 2.0 0.0,1.0,2.00.0,1.0,2.00.0,1.0,2.0, etc. in "universal time," for which see any good dictionary or encyclopedia) are taken from The American Ephemeris and Nautical Almanac for 1964 [U.S. Government Printing Office (1962)].

Box 25.1 MASS FROM MEAN ANGULAR FREQUENCY AND SEMIMAJOR AXIS: M = ω 2 a 3 M = ω 2 a 3 M=omega^(2)a^(3)M=\omega^{2} a^{3}M=ω2a3

Appropriateness of Newtonian analysis shown by smallness of mass (or "halfSchwarzschild radius" or "extension of the pit in the potential") as listed in last column compared to the semimajor axis a a aaa in the next-to-last column. Basic data from compilation of Allen (1963).
Object Period a ^("a "){ }^{\text {a }} (days) ω ( cm 1 ) ω cm 1 omega(cm^(-1))\omega\left(\mathrm{cm}^{-1}\right)ω(cm1) a ( cm ) a ( cm ) a(cm)a(\mathrm{~cm})a( cm) ω 2 a 3 ( cm ) ω 2 a 3 ( cm ) omega^(2)a^(3)(cm)\omega^{2} a^{3}(\mathrm{~cm})ω2a3( cm)
Planets
Mercury 87.9686 275.8 × 10 19 275.8 × 10 19 275.8 xx10^(-19)275.8 \times 10^{-19}275.8×1019 0.5791 × 10 13 0.5791 × 10 13 0.5791 xx10^(13)0.5791 \times 10^{13}0.5791×1013 1.477 × 10 5 1.477 × 10 5 1.477 xx10^(5)1.477 \times 10^{5}1.477×105
Venus 224.700 107.95 1.0821 1.477
Earth 365.257 66.41 1.4960 1.477
Mars 686.980 35.31 2.2794 1.477
Jupiter 4332.587 5.599 7.783 1.478
Saturn 10759.20 2.255 14.27 1.477
Uranus 30685 0.7905 28.69 1.476
Neptune 60188 0.4030 44.98 1.478
Pluto 90700 0.2674 × 10 19 0.2674 × 10 19 0.2674 xx10^(-19)0.2674 \times 10^{-19}0.2674×1019 59.00 × 10 13 59.00 × 10 13 59.00 xx10^(13)59.00 \times 10^{13}59.00×1013 1.469 × 10 5 1.469 × 10 5 1.469 xx10^(5)1.469 \times 10^{5}1.469×105
Major satellites of Jupiter
Io 1.769138 13.711 × 10 16 13.711 × 10 16 13.711 xx10^(-16)13.711 \times 10^{-16}13.711×1016 0.422 × 10 11 0.422 × 10 11 0.422 xx10^(11)0.422 \times 10^{11}0.422×1011 141.3
Europa 3.551181 6.831 0.671 141.0
Ganymede 7.154553 3.391 10 16 3.391 10 16 3.391∼10^(-16)3.391 \sim 10^{-16}3.3911016 1.070 140.8
Callisto 16.689018 1.454 × 10 16 1.454 × 10 16 1.454 xx10^(-16)1.454 \times 10^{-16}1.454×1016 1.883 × 10 11 1.883 × 10 11 1.883 xx10^(11)1.883 \times 10^{11}1.883×1011 141.1
Two satellites of Earth
OSO 5 b 5 5^("b ")5^{\text {b }}5 95.6 min . 3.65 × 10 14 3.65 × 10 14 3.65 xx10^(-14)3.65 \times 10^{-14}3.65×1014 6.916 × 10 8 6.916 × 10 8 6.916 xx10^(8)6.916 \times 10^{8}6.916×108 0.442
Moon 27.32 0.888 × 10 16 0.888 × 10 16 0.888 xx10^(-16)0.888 \times 10^{-16}0.888×1016 3.84 × 10 10 3.84 × 10 10 3.84 xx10^(10)3.84 \times 10^{10}3.84×1010 0.446
Object Period ^("a ") (days) omega(cm^(-1)) a(cm) omega^(2)a^(3)(cm) Planets Mercury 87.9686 275.8 xx10^(-19) 0.5791 xx10^(13) 1.477 xx10^(5) Venus 224.700 107.95 1.0821 1.477 Earth 365.257 66.41 1.4960 1.477 Mars 686.980 35.31 2.2794 1.477 Jupiter 4332.587 5.599 7.783 1.478 Saturn 10759.20 2.255 14.27 1.477 Uranus 30685 0.7905 28.69 1.476 Neptune 60188 0.4030 44.98 1.478 Pluto 90700 0.2674 xx10^(-19) 59.00 xx10^(13) 1.469 xx10^(5) Major satellites of Jupiter Io 1.769138 13.711 xx10^(-16) 0.422 xx10^(11) 141.3 Europa 3.551181 6.831 0.671 141.0 Ganymede 7.154553 3.391∼10^(-16) 1.070 140.8 Callisto 16.689018 1.454 xx10^(-16) 1.883 xx10^(11) 141.1 Two satellites of Earth OSO 5^("b ") 95.6 min . 3.65 xx10^(-14) 6.916 xx10^(8) 0.442 Moon 27.32 0.888 xx10^(-16) 3.84 xx10^(10) 0.446| Object | Period ${ }^{\text {a }}$ (days) | $\omega\left(\mathrm{cm}^{-1}\right)$ | $a(\mathrm{~cm})$ | $\omega^{2} a^{3}(\mathrm{~cm})$ | | :---: | :---: | :---: | :---: | :---: | | Planets | | | | | | Mercury | 87.9686 | $275.8 \times 10^{-19}$ | $0.5791 \times 10^{13}$ | $1.477 \times 10^{5}$ | | Venus | 224.700 | 107.95 | 1.0821 | 1.477 | | Earth | 365.257 | 66.41 | 1.4960 | 1.477 | | Mars | 686.980 | 35.31 | 2.2794 | 1.477 | | Jupiter | 4332.587 | 5.599 | 7.783 | 1.478 | | Saturn | 10759.20 | 2.255 | 14.27 | 1.477 | | Uranus | 30685 | 0.7905 | 28.69 | 1.476 | | Neptune | 60188 | 0.4030 | 44.98 | 1.478 | | Pluto | 90700 | $0.2674 \times 10^{-19}$ | $59.00 \times 10^{13}$ | $1.469 \times 10^{5}$ | | Major satellites of Jupiter | | | | | | Io | 1.769138 | $13.711 \times 10^{-16}$ | $0.422 \times 10^{11}$ | 141.3 | | Europa | 3.551181 | 6.831 | 0.671 | 141.0 | | Ganymede | 7.154553 | $3.391 \sim 10^{-16}$ | 1.070 | 140.8 | | Callisto | 16.689018 | $1.454 \times 10^{-16}$ | $1.883 \times 10^{11}$ | 141.1 | | Two satellites of Earth | | | | | | OSO $5^{\text {b }}$ | 95.6 min . | $3.65 \times 10^{-14}$ | $6.916 \times 10^{8}$ | 0.442 | | Moon | 27.32 | $0.888 \times 10^{-16}$ | $3.84 \times 10^{10}$ | 0.446 |
a ^("a "){ }^{\text {a }} Sidereal period: time to make one revolution relative to fixed stars.
b ^("b "){ }^{\text {b }} Orbiting scientific observatory launched Jan. 22, 1969, to observe x-rays and ultraviolet radiation from the sun. Perigee 531 km , apogee 560 km , above earth.
SOME TYPICAL MASSES AND TIMES IN CONVENTIONAL AND GEOMETRIC UNITS. Conversion factor for mass,
G / c 2 = 0.742 × 10 28 cm / g G / c 2 = 0.742 × 10 28 cm / g G//c^(2)=0.742 xx10^(-28)cm//gG / c^{2}=0.742 \times 10^{-28} \mathrm{~cm} / \mathrm{g}G/c2=0.742×1028 cm/g
Mass Galaxy Sun Jupiter Earth
M conv ( g ) M conv  ( g ) M_("conv ")(g)M_{\text {conv }}(\mathrm{g})Mconv (g) 2.2 × 10 44 2.2 × 10 44 2.2 xx10^(44)2.2 \times 10^{44}2.2×1044 1.989 × 10 33 1.989 × 10 33 1.989 xx10^(33)1.989 \times 10^{33}1.989×1033 1.899 × 10 30 1.899 × 10 30 1.899 xx10^(30)1.899 \times 10^{30}1.899×1030 5.977 × 10 27 5.977 × 10 27 5.977 xx10^(27)5.977 \times 10^{27}5.977×1027
M ( cm ) M ( cm ) M(cm)M(\mathrm{~cm})M( cm) 1.6 × 10 16 1.6 × 10 16 1.6 xx10^(16)1.6 \times 10^{16}1.6×1016 1.47 × 10 5 1.47 × 10 5 1.47 xx10^(5)1.47 \times 10^{5}1.47×105 112 0.444
Mass Galaxy Sun Jupiter Earth M_("conv ")(g) 2.2 xx10^(44) 1.989 xx10^(33) 1.899 xx10^(30) 5.977 xx10^(27) M(cm) 1.6 xx10^(16) 1.47 xx10^(5) 112 0.444| Mass | Galaxy | Sun | Jupiter | Earth | | :--- | :--- | :--- | :--- | :--- | | $M_{\text {conv }}(\mathrm{g})$ | $2.2 \times 10^{44}$ | $1.989 \times 10^{33}$ | $1.899 \times 10^{30}$ | $5.977 \times 10^{27}$ | | $M(\mathrm{~cm})$ | $1.6 \times 10^{16}$ | $1.47 \times 10^{5}$ | 112 | 0.444 |
Conversion factor for time, c = 2.998 × 10 10 cm / sec c = 2.998 × 10 10 cm / sec c=2.998 xx10^(10)cm//secc=2.998 \times 10^{10} \mathrm{~cm} / \mathrm{sec}c=2.998×1010 cm/sec. One sidereal year = 365.256 = 365.256 =365.256=365.256=365.256 days or 3.1558 × 10 7 3.1558 × 10 7 3.1558 xx10^(7)3.1558 \times 10^{7}3.1558×107 sec.
Period 1 sec 1 min 1 hr 1 day
ω conv ( sec 1 ) ω conv  sec 1 omega_("conv ")(sec^(-1))\omega_{\text {conv }}\left(\mathrm{sec}^{-1}\right)ωconv (sec1) 6.28 1.046 × 10 1 1.046 × 10 1 1.046 xx10^(-1)1.046 \times 10^{-1}1.046×101 1.75 × 10 3 1.75 × 10 3 1.75 xx10^(-3)1.75 \times 10^{-3}1.75×103 7.28 × 10 5 7.28 × 10 5 7.28 xx10^(-5)7.28 \times 10^{-5}7.28×105
ω ( cm 1 ) ω cm 1 omega(cm^(-1))\omega\left(\mathrm{~cm}^{-1}\right)ω( cm1) 2.09 × 10 10 2.09 × 10 10 2.09 xx10^(-10)2.09 \times 10^{-10}2.09×1010 3.48 × 10 12 3.48 × 10 12 3.48 xx10^(-12)3.48 \times 10^{-12}3.48×1012 5.80 × 10 14 5.80 × 10 14 5.80 xx10^(-14)5.80 \times 10^{-14}5.80×1014 2.42 × 10 15 2.42 × 10 15 2.42 xx10^(-15)2.42 \times 10^{-15}2.42×1015
1 week 1 month 1 year
1.04 × 10 5 1.04 × 10 5 1.04 xx10^(-5)1.04 \times 10^{-5}1.04×105 2.39 × 10 6 2.39 × 10 6 2.39 xx10^(-6)2.39 \times 10^{-6}2.39×106 1.99 × 10 7 1.99 × 10 7 1.99 xx10^(-7)1.99 \times 10^{-7}1.99×107
3.46 × 10 16 3.46 × 10 16 3.46 xx10^(-16)3.46 \times 10^{-16}3.46×1016 7.95 × 10 17 7.95 × 10 17 7.95 xx10^(-17)7.95 \times 10^{-17}7.95×1017 6.63 × 10 18 6.63 × 10 18 6.63 xx10^(-18)6.63 \times 10^{-18}6.63×1018
Period 1 sec 1 min 1 hr 1 day omega_("conv ")(sec^(-1)) 6.28 1.046 xx10^(-1) 1.75 xx10^(-3) 7.28 xx10^(-5) omega(cm^(-1)) 2.09 xx10^(-10) 3.48 xx10^(-12) 5.80 xx10^(-14) 2.42 xx10^(-15) 1 week 1 month 1 year 1.04 xx10^(-5) 2.39 xx10^(-6) 1.99 xx10^(-7) 3.46 xx10^(-16) 7.95 xx10^(-17) 6.63 xx10^(-18) | Period | 1 sec | 1 min | 1 hr | 1 day | | :--- | :---: | :---: | :---: | :---: | | $\omega_{\text {conv }}\left(\mathrm{sec}^{-1}\right)$ | 6.28 | $1.046 \times 10^{-1}$ | $1.75 \times 10^{-3}$ | $7.28 \times 10^{-5}$ | | $\omega\left(\mathrm{~cm}^{-1}\right)$ | $2.09 \times 10^{-10}$ | $3.48 \times 10^{-12}$ | $5.80 \times 10^{-14}$ | $2.42 \times 10^{-15}$ | | | 1 week | 1 month | 1 year | | | | $1.04 \times 10^{-5}$ | $2.39 \times 10^{-6}$ | $1.99 \times 10^{-7}$ | | | | $3.46 \times 10^{-16}$ | $7.95 \times 10^{-17}$ | $6.63 \times 10^{-18}$ | |
Figure 25.2
Effective potential for motion of a test particle in the Schwarzschild geometry of a concentrated mass M M MMM. Energy, in units of the rest mass μ μ mu\muμ of the particle, is denoted E ~ = E / μ E ~ = E / μ widetilde(E)=E//mu\widetilde{E}=E / \muE~=E/μ; angular momentum, L ~ = L / μ L ~ = L / μ widetilde(L)=L//mu\widetilde{L}=L / \muL~=L/μ. The quantity r r rrr denotes the Schwarzschild r r rrr coordinate. The effective potential (also in units of μ μ mu\muμ ) is defined by equation (25.16) or, equivalently, by the equation
( d r d τ ) 2 + V ~ 2 ( r ) = E ~ 2 d r d τ 2 + V ~ 2 ( r ) = E ~ 2 ((dr)/(d tau))^(2)+ widetilde(V)^(2)(r)= widetilde(E)^(2)\left(\frac{d r}{d \tau}\right)^{2}+\widetilde{V}^{2}(r)=\widetilde{E}^{2}(drdτ)2+V~2(r)=E~2
(see also $ 25.5 $ 25.5 $25.5\$ 25.5$25.5 ) and has the value
V ¯ = [ ( 1 2 M / r ) ( 1 + L ¯ 2 / r 2 ) ] 1 / 2 . V ¯ = ( 1 2 M / r ) 1 + L ¯ 2 / r 2 1 / 2 . bar(V)=[(1-2M//r)(1+ bar(L)^(2)//r^(2))]^(1//2).\bar{V}=\left[(1-2 M / r)\left(1+\bar{L}^{2} / r^{2}\right)\right]^{1 / 2} .V¯=[(12M/r)(1+L¯2/r2)]1/2.
It represents that value of E ~ E ~ widetilde(E)\widetilde{E}E~ at which the radial kinetic energy of the particle, at r r rrr, reduces to zero ( E ~ E ~ widetilde(E)\widetilde{E}E~-value that makes r r rrr into a "turning point": V ¯ ( r ) = E ¯ V ¯ ( r ) = E ¯ bar(V)(r)= bar(E)\bar{V}(r)=\bar{E}V¯(r)=E¯. Note that one could equally well regard V ¯ 2 ( r ) V ¯ 2 ( r ) bar(V)^(2)(r)\bar{V}^{2}(r)V¯2(r) as the effective potential, and define a turning point by the condition V ~ 2 = E ~ 2 V ~ 2 = E ~ 2 widetilde(V)^(2)= widetilde(E)^(2)\widetilde{V}^{2}=\widetilde{E}^{2}V~2=E~2. Which definition one chooses depends on convenience, on the intended application, on the tie to the archetypal differential equation 1 2 x ˙ 2 + V ( x ) = E 1 2 x ˙ 2 + V ( x ) = E (1)/(2)x^(˙)^(2)+V(x)=E\frac{1}{2} \dot{x}^{2}+V(x)=E12x˙2+V(x)=E, and on the stress one wishes to put on correspondence with the effective potential of Newtonian theory). Stable circular orbits are possible (representative point sitting at minimum of effective potential) only for L ~ L ~ tilde(L)\tilde{L}L~ values in excess of 2 3 M 2 3 M 2sqrt3M2 \sqrt{3} M23M. For any such fixed L L L\mathcal{L}L value, the motion departs slightly from circularity as the energy is raised above the potential minimum (see the two heavy horizontal lines for L ~ = 3.75 M L ~ = 3.75 M widetilde(L)=3.75M\widetilde{L}=3.75 \mathrm{M}L~=3.75M ). In classical physics, the motion is limited to the region of positive kinetic encrgy. In quantum physics, the particle can tunnel through the region where the kinetic energy, as calculated classically, is negative (dashed prolongations of heavy horizontal lines) and head for the "pit in the potential" (capture by black hole). Such tunneling is absolutely negligible when the center of attraction has any macroscopic dimension, but in principle becomes important for a black hole of mass 10 17 g 10 17 g 10^(17)g10^{17} \mathrm{~g}1017 g (or 10 11 cm 10 11 cm 10^(-11)cm10^{-11} \mathrm{~cm}1011 cm ) if such an object can in principle exist.
The diagram at the right gives values of the minimum and maximum of the potential as they depend on the angular momentum of the test particle. The roots of V ~ / r V ~ / r del widetilde(V)//del r\partial \widetilde{V} / \partial rV~/r are given in terms of the "reduced angular momentum parameter" L = L ~ / M = L / M μ L = L ~ / M = L / M μ L^(†)= widetilde(L)//M=L//M muL^{\dagger}=\widetilde{L} / M=L / M \muL=L~/M=L/Mμ by
r = 6 M 1 + ( 1 12 / L 2 ) 1 / 2 , E ~ 2 = ( L + 2 + 36 ) + ( L 2 12 ) ( 1 12 / L 2 ) 1 / 2 54 [ = 8 / 9 for L = ( 12 ) 1 / 2 ; 1 for L = 4 ; ( L 2 / 27 ) + ( 1 / 3 ) + ( 1 / L + 2 ) + for L ] r = 6 M 1 + 1 12 / L 2 1 / 2 , E ~ 2 = L + 2 + 36 + L 2 12 1 12 / L 2 1 / 2 54 = 8 / 9  for  L = ( 12 ) 1 / 2 ; 1  for  L = 4 ; L 2 / 27 + ( 1 / 3 ) + 1 / L + 2 +  for  L {:[r=(6M)/(1+(1-12//L^(†2))^(1//2))","],[ widetilde(E)^(2)=((L^(+2)+36)+(L^(†2)-12)(1-12//L^(†2))^(1//2))/(54)],[[=8//9" for "L^(†)=(12)^(1//2);1" for "L^(†)=4;(L^(†2)//27)+(1//3)+(1//L^(+2)):}],[{:+cdots" for "L^(†)longrightarrow oo]]:}\begin{gathered} r=\frac{6 M}{1+\left(1-12 / L^{\dagger 2}\right)^{1 / 2}}, \\ \widetilde{E}^{2}=\frac{\left(L^{+2}+36\right)+\left(L^{\dagger 2}-12\right)\left(1-12 / L^{\dagger 2}\right)^{1 / 2}}{54} \\ {\left[=8 / 9 \text { for } L^{\dagger}=(12)^{1 / 2} ; 1 \text { for } L^{\dagger}=4 ;\left(L^{\dagger 2} / 27\right)+(1 / 3)+\left(1 / L^{+2}\right)\right.} \\ \left.+\cdots \text { for } L^{\dagger} \longrightarrow \infty\right] \end{gathered}r=6M1+(112/L2)1/2,E~2=(L+2+36)+(L212)(112/L2)1/254[=8/9 for L=(12)1/2;1 for L=4;(L2/27)+(1/3)+(1/L+2)+ for L]
(plus root for maximum of the effective potential; minus root for minimum; see exercise 25.18 ).

Box 25.2 MOTION IN SCHWARZSCHILD GEOMETRY REGARDED AS A CENTRAL POINT OF DEPARTURE FOR MAJOR APPLICATIONS OF EINSTEIN'S GEOMETRODYNAMICS
  1. Newtonian effect of sun on planets and of earth on moon and man.
  2. Bending of light by sun.
  3. Red shift of light from sun.
  4. Precession of the perihelion of Mercury around the sun.
  5. Capture of a test object by a black hole as simple exemplar of gravitational collapse.
  6. Dynamics of Friedmann universe derived from model of Schwarzschild "lattice universe." Lattice universe is constructed from 120 or some other magic number of concentrations of mass, each mass in an otherwise empty lattice cell of its own. Each lattice cell, though actually polygonal, is idealized (see Wigner-Seitz approximation of solid-state physics) as spherical. A test object at the interface between two cells falls toward the center of each [standard radial motion in Schwarzschild geometry; see discussion following equation (25.27). Therefore the two masses fall toward each other at a calculable rate. From this simple argument follows the entire dynamics of the closed 3 -sphere lattice universe, in close concord with the predictions of the Friedmann model [see Lindquist and Wheeler (1957)].
  7. Perturbations of Schwarzschild geometry, I. Gravitational waves are incident on, scattered by, and captured into a black hole. Waves with wavelength short compared to the Schwarzschild radius can be analyzed to good approximation by the methods of geometric optics (exercises 35.15 and 35.16), as employed in this chapter to treat the motions of particles and photons. For longer wavelengths, there occur important physical-optics corrections to this
    geometric-optics idealization (see § 35.8 § 35.8 §35.8\S 35.8§35.8 and exercises 32.10, 32.11). Similar considerations apply to electromagnetic and de Broglie waves.
  8. Lepton number for an electron in its lowest quantum state in the geometry ("gravitational field of force") of a black hole is calculated to be transcended (capture of the electron!) or not according as the mass of this black hole is large or small compared to a certain critical mass M e = M 2 / m e ( 10 17 g M e = M 2 / m e 10 17 g M_(**e)=M^(**2)//m_(e)(∼10^(17)(g):}M_{* e}=M^{* 2} / m_{e}\left(\sim 10^{17} \mathrm{~g}\right.Me=M2/me(1017 g or 10 11 cm ) 10 11 cm {:10^(-11)(cm))\left.10^{-11} \mathrm{~cm}\right)1011 cm) [Hartle (1971, 1972); Wheeler (1971b,c); Teitelboim (1972b,c)]. Similarly (with another value for the critical mass) for conservation of baryon number [Bekenstein (1972a,b), Teitelboim (1972a)]. To analyze "transcendence or not" one must solve quantum-mechanical wave equations, of which the Hamilton-Jacobi equation for particle and photon orbits is a classical limit. These quantum wave equations contain effective potentials identical-aside from spin-dependent and wavelength-dependent corrections-to the effective potentials for particle and photon motion.
  9. Perturbations of Schwarzschild geometry, II. Those small changes in standard Schwarzschild black-hole geometry which remain stationary in time describe the alterations in a "dead" black hole that make it into a "live" black hole, one endowed with angular momentum as well as mass (see Chapter 33). To analyze such changes in black-hole geometry, one must again solve wave equations, but wave equations which are now classical. Once more the wave equations are closely related to the Hamilton-Jacobi equation, and their effective potentials are close kin to those for particle motion.
    ployed here in turn because each gives special insights. The Hamilton-Jacobi method (Box 25.3) leads quickly to the major results of interest (Box 25.4), and it has a close tie to the quantum principle. The world-line method ( $ $ 25.2 , 25.3 , 25.4 $ $ 25.2 , 25.3 , 25.4 $$25.2,25.3,25.4\$ \$ 25.2,25.3,25.4$$25.2,25.3,25.4 ) starts with the geodesic equations of motion themselves. It provides a more familiar way into the subject for a reader not acquainted with the Hamilton-Jacobi approach. Moreover, in attempting to solve the geodesic equations of motion, one must analyze symmetry properties of the geometry, an enterprise that continues to pay dividends when one moves from Schwarzschild geometry to Kerr-Newman geometry (Chapter 33), and from Friedmann cosmology (Chapter 27) to more general cosmologies (Chapter 30).
    (continued on page 650)

Box 25.3 THE HAMILTON-JACOBI DESCRIPTION OF MOTION: NATURAL BECAUSE RATIFIED BY THE QUANTUM PRINCIPLE

  1. Purely classical (nonquantum).
  2. Originated with William Rowan Hamilton out of conviction that mechanics is similar in its character to optics; that the "particle world line" of mechanics is an idealization analogous to the "light ray" of geometric optics. Localization of energy of light ray is approximate only. Its spread is governed by wavelength of light ("geometric optics"). Hamilton had glimmerings of same idea for particles: "quantum physics before quantum physics." The way that he and Jacobi developed to analyze motion through the Hamilton-Jacobi function S ( x , t ) S ( x , t ) S(x,t)S(x, t)S(x,t)-to take the example of a dynamic system with only one degree of freedom, x x xxx makes the leap from classical ideas to quantum ideas as short as one knows how to make it. Moreover, the real world is a quantum world. Classical mechanics is not born out of a vacuum. It is an idealization of and approximation to quantum mechanics.
  3. Key idea is idealization to a particle wavelength so short that quantum-mechanical spread or uncertainty in location of particle (or spread of configuration coordinates of more complex system) is negligible. No better way was ever discovered to unite the spirit of quantum mechanics and the precision of location of classical mechanics.
  4. Call Hamiltonian H ( p , x ) = p 2 / 2 m + V ( x ) H ( p , x ) = p 2 / 2 m + V ( x ) H(p,x)=p^(2)//2m+V(x)H(p, x)=p^{2} / 2 m+V(x)H(p,x)=p2/2m+V(x). Call energy of particle E E EEE. Then there is no way whatever consistent with the quantum principle to describe the motion of the particle in space and time. The uncertainty principle forbids (sharply defined energy Δ E 0 Δ E 0 Delta E longrightarrow0\Delta E \longrightarrow 0ΔE0, in Δ E Δ t / 2 Δ E Δ t / 2 Delta E Delta t >= ℏ//2\Delta E \Delta t \geq \hbar / 2ΔEΔt/2, implies uncertainty Δ t Δ t Delta t longrightarrow oo\Delta t \longrightarrow \inftyΔt; also Δ p 0 Δ p 0 Delta p longrightarrow0\Delta p \longrightarrow 0Δp0 in Δ p Δ x / 2 Δ p Δ x / 2 Delta p Delta x >= ℏ//2\Delta p \Delta x \geq \hbar / 2ΔpΔx/2 implies Δ x ) Δ x ) Delta x longrightarrow oo)\Delta x \longrightarrow \infty)Δx). The quantum-mechanical wave function is spread out over all space. This spread shows in the so-called semi-

Box 25.3 (continued)

classical or Wentzel-Kramers-Brillouin ["WKB"; see, for example, Kemble (1937)] approximation for the probability amplitude function,
(1) ψ E ( x , t ) = ( slowly varying amplitude function ) e ( i / h ) S E ( x , t ) . (1) ψ E ( x , t ) = (  slowly varying   amplitude function  ) e ( i / h ) S E ( x , t ) . {:(1)psi_(E)(x","t)=((" slowly varying ")/(" amplitude function "))e^((i//h)S_(E)(x,t)).:}\begin{equation*} \psi_{E}(x, t)=\binom{\text { slowly varying }}{\text { amplitude function }} e^{(i / h) S_{E}(x, t)} . \tag{1} \end{equation*}(1)ψE(x,t)=( slowly varying  amplitude function )e(i/h)SE(x,t).

5. It is of no help in localizing the probability distribution that = 1.054 × = 1.054 × ℏ=1.054 xx\hbar=1.054 \times=1.054× 10 27 g cm 2 / s 10 27 g cm 2 / s 10^(-27)gcm^(2)//s10^{-27} \mathrm{~g} \mathrm{~cm}^{2} / \mathrm{s}1027 g cm2/s [or = ( 1.6 × 10 33 cm ) 2 = 1.6 × 10 33 cm 2 ℏ=(1.6 xx10^(-33)(cm))^(2)\hbar=\left(1.6 \times 10^{-33} \mathrm{~cm}\right)^{2}=(1.6×1033 cm)2 in geometric units] is very small compared to the "quantities of action" or "magnitudes of the Hamilton-Jacobi function, S S SSS " or "dynamic phase, S S SSS " encountered in most everyday applications.
6. It is of no help in localizing the probability distribution that this dynamic phase obeys the simple Hamilton-Jacobi law of propagation,
(2) S t = H ( S x , x ) = 1 2 m ( S x ) 2 + V ( x ) . (2) S t = H S x , x = 1 2 m S x 2 + V ( x ) . {:(2)-(del S)/(del t)=H((del S)/(del x),x)=(1)/(2m)((del S)/(del x))^(2)+V(x).:}\begin{equation*} -\frac{\partial S}{\partial t}=H\left(\frac{\partial S}{\partial x}, x\right)=\frac{1}{2 m}\left(\frac{\partial S}{\partial x}\right)^{2}+V(x) . \tag{2} \end{equation*}(2)St=H(Sx,x)=12m(Sx)2+V(x).
  1. It is of no help in localizing the probability distribution that the solution of this equation for a particle of energy E E EEE is extraordinarily simple,
(3) S ( x , t ) = E t + x 0 x { 2 m [ E V ( x ) ] } 1 / 2 d x + δ E (3) S ( x , t ) = E t + x 0 x { 2 m [ E V ( x ) ] } 1 / 2 d x + δ E {:(3)S(x","t)=-Et+int_(x_(0))^(x){2m[E-V(x)]}^(1//2)dx+delta_(E):}\begin{equation*} S(x, t)=-E t+\int_{x_{0}}^{x}\{2 m[E-V(x)]\}^{1 / 2} d x+\delta_{E} \tag{3} \end{equation*}(3)S(x,t)=Et+x0x{2m[EV(x)]}1/2dx+δE
(with δ E δ E delta_(E)\delta_{E}δE an arbitrary additive phase constant). The probability amplitude is still spread all over everywhere. There is not the slightest trace of anything like a localized world line, x = x ( t ) x = x ( t ) x=x(t)x=x(t)x=x(t).
8. To localize the particle, build a probabilityamplitude wave packet by superposing monofrequency (monoenergy) terms, according to a prescription qualitatively of the form
(4) ψ ( x , t ) = ψ E ( x , t ) + ψ E + Δ E ( x , t ) + (4) ψ ( x , t ) = ψ E ( x , t ) + ψ E + Δ E ( x , t ) + {:(4)psi(x","t)=psi_(E)(x","t)+psi_(E+Delta E)(x","t)+cdots*:}\begin{equation*} \psi(x, t)=\psi_{E}(x, t)+\psi_{E+\Delta E}(x, t)+\cdots \cdot \tag{4} \end{equation*}(4)ψ(x,t)=ψE(x,t)+ψE+ΔE(x,t)+
Superposition of monoenergy waves to give wave packet
Destructive interference takes place almost everywhere. The wave packet is concentrated in the region of constructive interference. There the phases of the various waves agree; thus
(5) S E ( x , t ) = S E + Δ E ( x , t ) (5) S E ( x , t ) = S E + Δ E ( x , t ) {:(5)S_(E)(x","t)=S_(E+Delta E)(x","t):}\begin{equation*} S_{E}(x, t)=S_{E+\Delta E}(x, t) \tag{5} \end{equation*}(5)SE(x,t)=SE+ΔE(x,t)
At last one has moved from a wave spread everywhere to a localized wave and thence, in the limit of indefinitely small wavelength, to a classical world line. This one equation of constructive interference ties together x x xxx and t t ttt (locus of world line in x , t x , t x,tx, tx,t, diagram). Smooth lines 20 , 19 , 18 20 , 19 , 18 -20,-19,-18-20,-19,-1820,19,18, etc. are wave crests of ψ E ψ E psi_(E)\psi_{E}ψE; dashed lines, wave crests for ψ E + Δ E ψ E + Δ E psi_(E+Delta E)\psi_{E+\Delta E}ψE+ΔE. Shaded area is region of constructive interference (wave packet). Black dots mark locus of classical world line,
Lim Δ E 0 S E + Δ E ( x , t ) S E ( x , t ) Δ E = 0 Lim Δ E 0 S E + Δ E ( x , t ) S E ( x , t ) Δ E = 0 Lim_(Delta E rarr0)(S_(E+Delta E)(x,t)-S_(E)(x,t))/(Delta E)=0\operatorname{Lim}_{\Delta E \rightarrow 0} \frac{S_{E+\Delta E}(x, t)-S_{E}(x, t)}{\Delta E}=0LimΔE0SE+ΔE(x,t)SE(x,t)ΔE=0

9. The Newtonian course of the world line through spacetime follows at once from this condition of constructive interference when one goes to the classical limit ( \hbar negligible compared to amounts of action involved; hence wavelength negligibly short; hence spread of energies Δ E Δ E Delta E\Delta EΔE required to build well-localized wave packet also negligible); thus
S E + Δ E ( x , t ) S E ( x , t ) Δ E = 0 S E + Δ E ( x , t ) S E ( x , t ) Δ E = 0 (S_(E+Delta E)(x,t)-S_(E)(x,t))/(Delta E)=0\frac{S_{E+\Delta E}(x, t)-S_{E}(x, t)}{\Delta E}=0SE+ΔE(x,t)SE(x,t)ΔE=0
reduces to
S E ( x , t ) E = 0 S E ( x , t ) E = 0 (delS_(E)(x,t))/(del E)=0\frac{\partial S_{E}(x, t)}{\partial E}=0SE(x,t)E=0

Box 25.3 (continued)

This condition in turn, applied to expression (3), gives the time required to travel to the point x x xxx; thus,
t + x 0 x d x { ( 2 / m ) [ E V ( x ) ] } 1 / 2 + t 0 = 0 t + x 0 x d x { ( 2 / m ) [ E V ( x ) ] } 1 / 2 + t 0 = 0 -t+int_(x_(0))^(x)(dx)/({(2//m)[E-V(x)]}^(1//2))+t_(0)=0-t+\int_{x_{0}}^{x} \frac{d x}{\{(2 / m)[E-V(x)]\}^{1 / 2}}+t_{0}=0t+x0xdx{(2/m)[EV(x)]}1/2+t0=0
where t 0 t 0 t_(0)t_{0}t0 is an abbreviation for the quantity
t 0 = d δ E / d E t 0 = d δ E / d E t_(0)=ddelta_(E)//dEt_{0}=d \delta_{E} / d Et0=dδE/dE
("difference in base value of dynamic phase per unit difference of energy").
10. Not one trace of the quantum of action comes into this final Newtonian result, for a simple reason: \hbar has been treated as negligible and the wavelength has been treated as negligible. In this limit the location of the wave "packet" reduces to the location of the wave crest. The location of the wave crest is precisely what is governed by S E ( x , t ) S E ( x , t ) S_(E)(x,t)S_{E}(x, t)SE(x,t); and the condition of "constructive interference" S E ( x , t ) / E = 0 S E ( x , t ) / E = 0 delS_(E)(x,t)//del E=0\partial S_{E}(x, t) / \partial E=0SE(x,t)/E=0 gives without approximation the location of the sharply defined Newtonian world line x = x ( t ) x = x ( t ) x=x(t)x=x(t)x=x(t).
11. Relevance in the context of motion in a central field of force? Quickest known route to the concept of effective potential (Box 25.4).
Box 25.4 MOTION UNDER GRAVITATIONAL ATTRACTION OF A CENTRAL MASS ANALYZED BY HAMILTON-JACOBI METHOD

A. Newtonian Theory of Gravitation

Hamiltonian
(1) H ~ = p ~ r 2 2 + p ~ θ 2 2 r 2 + p ~ ϕ 2 2 r 2 sin 2 θ M r (1) H ~ = p ~ r 2 2 + p ~ θ 2 2 r 2 + p ~ ϕ 2 2 r 2 sin 2 θ M r {:(1) widetilde(H)=( widetilde(p)_(r)^(2))/(2)+( widetilde(p)_(theta)^(2))/(2r^(2))+( widetilde(p)_(phi)^(2))/(2r^(2)sin^(2)theta)-(M)/(r):}\begin{equation*} \widetilde{H}=\frac{\widetilde{p}_{r}{ }^{2}}{2}+\frac{\widetilde{p}_{\theta}{ }^{2}}{2 r^{2}}+\frac{\widetilde{p}_{\phi}{ }^{2}}{2 r^{2} \sin ^{2} \theta}-\frac{M}{r} \tag{1} \end{equation*}(1)H~=p~r22+p~θ22r2+p~ϕ22r2sin2θMr
(tildes over energy, momentum, etc., refer to test object of unit mass; test particle of mass μ μ mu\muμ follows same motion with energy E = μ E ~ E = μ E ~ E=mu widetilde(E)E=\mu \widetilde{E}E=μE~, momentum p = μ p ~ p = μ p ~ p=mu widetilde(p)\boldsymbol{p}=\mu \widetilde{\boldsymbol{p}}p=μp~, etc.).
Equation of Hamilton-Jacobi for propagation of wave crests:
(2) S ~ t = 1 2 ( S ~ r ) 2 + 1 2 r 2 ( S ~ θ ) 2 + 1 2 r 2 sin 2 θ ( S ~ ϕ ) 2 M r . (2) S ~ t = 1 2 S ~ r 2 + 1 2 r 2 S ~ θ 2 + 1 2 r 2 sin 2 θ S ~ ϕ 2 M r . {:(2)-(del( widetilde(S)))/(del t)=(1)/(2)((del( widetilde(S)))/(del r))^(2)+(1)/(2r^(2))((del( widetilde(S)))/(del theta))^(2)+(1)/(2r^(2)sin^(2)theta)((del( widetilde(S)))/(del phi))^(2)-(M)/(r).:}\begin{equation*} -\frac{\partial \widetilde{S}}{\partial t}=\frac{1}{2}\left(\frac{\partial \widetilde{S}}{\partial r}\right)^{2}+\frac{1}{2 r^{2}}\left(\frac{\partial \widetilde{S}}{\partial \theta}\right)^{2}+\frac{1}{2 r^{2} \sin ^{2} \theta}\left(\frac{\partial \widetilde{S}}{\partial \phi}\right)^{2}-\frac{M}{r} . \tag{2} \end{equation*}(2)S~t=12(S~r)2+12r2(S~θ)2+12r2sin2θ(S~ϕ)2Mr.

Box 25.4 (continued)

Solve by "method of separation of variables" with convention that a 2 ± a a 2 ± a sqrt(a^(2))-=+-a\sqrt{a^{2}} \equiv \pm aa2±a,
S ~ = E ~ t + p ~ ϕ ϕ + θ ( L ~ 2 p ~ ϕ 2 sin 2 θ ) 1 / 2 d θ (3) + r [ 2 ( E ~ + M r L ~ 2 2 r 2 ) ] 1 / 2 d r + δ p ~ ϕ , , ~ , E ~ . S ~ = E ~ t + p ~ ϕ ϕ + θ L ~ 2 p ~ ϕ 2 sin 2 θ 1 / 2 d θ (3) + r 2 E ~ + M r L ~ 2 2 r 2 1 / 2 d r + δ p ~ ϕ , , ~ , E ~ . {:[ widetilde(S)=- widetilde(E)t+ widetilde(p)_(phi)phi+int^(theta)( widetilde(L)^(2)-( widetilde(p)_(phi)^(2))/(sin^(2)theta))^(1//2)d theta],[(3)+int^(r)[2(( widetilde(E))+(M)/(r)-( widetilde(L)^(2))/(2r^(2)))]^(1//2)dr+delta_( tilde(p)_(phi), tilde(,), tilde(E)).]:}\begin{align*} \widetilde{S}= & -\widetilde{E} t+\widetilde{p}_{\phi} \phi+\int^{\theta}\left(\widetilde{L}^{2}-\frac{\widetilde{p}_{\phi}{ }^{2}}{\sin ^{2} \theta}\right)^{1 / 2} d \theta \\ & +\int^{r}\left[2\left(\widetilde{E}+\frac{M}{r}-\frac{\widetilde{L}^{2}}{2 r^{2}}\right)\right]^{1 / 2} d r+\delta_{\tilde{p}_{\phi}, \tilde{,}, \tilde{E}} . \tag{3} \end{align*}S~=E~t+p~ϕϕ+θ(L~2p~ϕ2sin2θ)1/2dθ(3)+r[2(E~+MrL~22r2)]1/2dr+δp~ϕ,,~,E~.
(Check by substituting into Hamilton-Jacobi equation. Solution as sum of four terms corresponding to the four independent variables goes hand in hand with expression of probability amplitude in quantum mechanics as product of four factors, because i S / = i μ S ~ / i S / = i μ S ~ / iS//ℏ=i mu widetilde(S)//ℏi S / \hbar=i \mu \widetilde{S} / \hbariS/=iμS~/ is exponent in approximate expression for the probability amplitude.)
Constructive interference of waves:
(1) with slightly different E ~ E ~ widetilde(E)\widetilde{E}E~ values (impose "condition of constructive interference" S ~ p ~ φ L ~ , E ~ ( t , r , θ , ϕ ) / E ~ = 0 ) S ~ p ~ φ L ~ , E ~ ( t , r , θ , ϕ ) / E ~ = 0 {: del widetilde(S)_( tilde(p)_(varphi))( tilde(L)),( tilde(E))(t,r,theta,phi)//del( widetilde(E))=0)\left.\partial \widetilde{S}_{\tilde{p}_{\varphi}} \tilde{L}, \tilde{E}(t, r, \theta, \phi) / \partial \widetilde{E}=0\right)S~p~φL~,E~(t,r,θ,ϕ)/E~=0) tells when the particle arrives at a given r r rrr (that is, gives relation between t t ttt and r r rrr );
(2) with slightly different values of the "parameter of total angular momentum per unit mass," L ~ L ~ widetilde(L)\widetilde{L}L~ (impose condition of constructive interference S ~ p ~ ϕ , L ~ , E ~ ( t , r , θ , ϕ ) / L ~ = 0 S ~ p ~ ϕ , L ~ , E ~ ( t , r , θ , ϕ ) / L ~ = 0 del widetilde(S)_( tilde(p)_(phi), tilde(L), tilde(E))(t,r,theta,phi)//del widetilde(L)=0\partial \widetilde{S}_{\tilde{p}_{\phi}, \tilde{L}, \tilde{E}}(t, r, \theta, \phi) / \partial \widetilde{L}=0S~p~ϕ,L~,E~(t,r,θ,ϕ)/L~=0 ) tells correlation between θ θ theta\thetaθ and r r rrr (a major feature of the shape of the orbit);
(3) with slightly different values of the "parameter of azimuthal angular momentum per unit mass," p ~ ϕ p ~ ϕ widetilde(p)_(phi)\widetilde{p}_{\phi}p~ϕ (impose condition S ~ / p ~ ϕ = 0 S ~ / p ~ ϕ = 0 del widetilde(S)//del widetilde(p)_(phi)=0\partial \widetilde{S} / \partial \widetilde{p}_{\phi}=0S~/p~ϕ=0 ) gives correlation between θ θ theta\thetaθ and ϕ ϕ phi\phiϕ,
(4) 0 = S ~ p ~ ϕ = ϕ θ ( p ~ ϕ / L ~ ) d θ sin θ ( sin 2 θ p ~ ϕ 2 / L ~ 2 ) 1 / 2 (4) 0 = S ~ p ~ ϕ = ϕ θ p ~ ϕ / L ~ d θ sin θ sin 2 θ p ~ ϕ 2 / L ~ 2 1 / 2 {:(4)0=(del( widetilde(S)))/(del widetilde(p)_(phi))=phi-int^(theta)(( widetilde(p)_(phi)//( widetilde(L)))d theta)/(sin theta(sin^(2)theta- widetilde(p)_(phi)^(2)// widetilde(L)^(2))^(1//2)):}\begin{equation*} 0=\frac{\partial \widetilde{S}}{\partial \widetilde{p}_{\phi}}=\phi-\int^{\theta} \frac{\left(\widetilde{p}_{\phi} / \widetilde{L}\right) d \theta}{\sin \theta\left(\sin ^{2} \theta-\widetilde{p}_{\phi}^{2} / \widetilde{L}^{2}\right)^{1 / 2}} \tag{4} \end{equation*}(4)0=S~p~ϕ=ϕθ(p~ϕ/L~)dθsinθ(sin2θp~ϕ2/L~2)1/2
Planar character of the orbit.
Puzzle out the value of this last integral with the help of a table of integrals? It is quicker and clearer to capture the content without calculation: the particle moves in a plane. The vector associated with the angular momentum L ~ L ~ widetilde(L)\widetilde{L}L~ stands perpendicular to this plane. The projection of this angular momentum along the z z zzz-axis is p ~ ϕ = L ~ cos α p ~ ϕ = L ~ cos α widetilde(p)_(phi)= widetilde(L)cos alpha\widetilde{p}_{\phi}=\widetilde{L} \cos \alphap~ϕ=L~cosα (definition of orbital inclination, α α alpha\alphaα ). Straight line connecting origin with particle cuts unit sphere in a point P P P\mathscr{P}P. As time runs on, P P P\mathscr{P}P traces out a great circle on the unit sphere. The plane of this great circle cuts the equatorial plane in a "line of nodes," at which "hinge-line" the two planes are separated by a dihedral angle, α α alpha\alphaα. The orbit of the point P P P\mathscr{P}P is described by x ^ = r cos ψ , y ^ = r sin ψ , z ^ = 0 x ^ = r cos ψ , y ^ = r sin ψ , z ^ = 0 hat(x)=r cos psi, hat(y)=r sin psi, hat(z)=0\hat{x}=r \cos \psi, \hat{y}=r \sin \psi, \hat{z}=0x^=rcosψ,y^=rsinψ,z^=0 in a Cartesian system of coordinates in which y ^ y ^ hat(y)\hat{y}y^ runs along the line of nodes and in which x ^ x ^ hat(x)\hat{x}x^ lies in the plane of the orbit.

Box 25.4 (continued)

In a coordinate system in which y y yyy runs along the line of nodes and x x xxx lies in the plane of the equator, one has:
r cos θ = z r z cos α + x ^ sin α = r cos ψ sin α ; r sin θ cos ϕ = x r sin θ sin ϕ = y = z ^ sin α + x ^ cos α = r cos ψ cos α ; r sin ψ . r cos θ = z r z cos α + x ^ sin α = r cos ψ sin α ; r sin θ cos ϕ = x r sin θ sin ϕ = y = z ^ sin α + x ^ cos α = r cos ψ cos α ; r sin ψ . {:[r cos theta=z],[rz cos alpha+ hat(x)sin alpha=r cos psi sin alpha;],[r sin theta cos phi=x],[r sin theta sin phi=y= hat(z)sin alpha+ hat(x)cos alpha=r cos psi cos alpha;],[r sin psi.]:}\begin{aligned} & r \cos \theta=z \\ & r \operatorname{z} \cos \alpha+\hat{x} \sin \alpha=r \cos \psi \sin \alpha ; \\ & r \sin \theta \cos \phi=x \\ & r \sin \theta \sin \phi=y=\hat{z} \sin \alpha+\hat{x} \cos \alpha=r \cos \psi \cos \alpha ; \\ & r \sin \psi . \end{aligned}rcosθ=zrzcosα+x^sinα=rcosψsinα;rsinθcosϕ=xrsinθsinϕ=y=z^sinα+x^cosα=rcosψcosα;rsinψ.
Eliminate reference to the Cartesian coordinates and, by taking ratios, also eliminate reference to r r rrr. Thus find the equation of the great circle route in parametric form,
tan ϕ = tan ψ / cos α tan ϕ = tan ψ / cos α tan phi=tan psi//cos alpha\tan \phi=\tan \psi / \cos \alphatanϕ=tanψ/cosα
and
cos θ = cos ψ sin α . cos θ = cos ψ sin α . cos theta=cos psi sin alpha.\cos \theta=\cos \psi \sin \alpha .cosθ=cosψsinα.
Here increasing values of ψ ψ psi\psiψ spell out successive points on the great circle. Eliminate ψ ψ psi\psiψ via the relation
sec 2 ψ tan 2 ψ = 1 sec 2 ψ tan 2 ψ = 1 sec^(2)psi-tan^(2)psi=1\sec ^{2} \psi-\tan ^{2} \psi=1sec2ψtan2ψ=1
to find
sin 2 α cos 2 θ tan 2 ϕ cos 2 α = 1 sin 2 α cos 2 θ tan 2 ϕ cos 2 α = 1 (sin^(2)alpha)/(cos^(2)theta)-tan^(2)phicos^(2)alpha=1\frac{\sin ^{2} \alpha}{\cos ^{2} \theta}-\tan ^{2} \phi \cos ^{2} \alpha=1sin2αcos2θtan2ϕcos2α=1
or, more briefly,
(5) sec ϕ = tan α tan θ . (5) sec ϕ = tan α tan θ {:(5)sec phi=tan alpha tan theta". ":}\begin{equation*} \sec \phi=\tan \alpha \tan \theta \text {. } \tag{5} \end{equation*}(5)secϕ=tanαtanθ
One verifies that ϕ ϕ phi\phiϕ as calculated from (5) provides an integral of (4), thus confirming the physical argument just traced out. Moreover, the arbitrary constant of integration that comes from (4), left out for the sake of simplicity from (5), is easily inserted by replacing ϕ ϕ phi\phiϕ there by ϕ ϕ 0 ϕ ϕ 0 phi-phi_(0)\phi-\phi_{0}ϕϕ0 (rotation of line of nodes to a new azimuth). The kind of physics just done in tracing out the relation between θ θ theta\thetaθ and ϕ ϕ phi\phiϕ is evidently elementary solid geometry and nothing more. The same geometric relationships also show up, with no relativistic corrections whatsoever (how could there be any?!) for motion in Schwarzschild geometry. Therefore it is appropriate to drop this complication from attention here and hereafter. Let the particle move entirely in the direction of increasing θ θ theta\thetaθ, not at all in the direction of increasing ϕ ϕ phi\phiϕ; that is, let it move in an orbit of zero angular momentum p ~ ϕ p ~ ϕ widetilde(p)_(phi)\widetilde{p}_{\phi}p~ϕ (total angular momentum vector L ~ L ~ widetilde(L)\widetilde{L}L~ inclined at angle α = π / 2 α = π / 2 alpha=pi//2\alpha=\pi / 2α=π/2 to z z zzz-axis). Consequently the dynamic phase S S SSS (to be divided by \hbar to obtain phase of Schrödinger wave function when one turns from classical to quantum mechanics) becomes
(6) S ~ = E ~ t + L ~ θ + r [ 2 ( E ~ + M r L ~ 2 2 r 2 ) ] 1 / 2 d r + δ L ~ , E ~ (6) S ~ = E ~ t + L ~ θ + r 2 E ~ + M r L ~ 2 2 r 2 1 / 2 d r + δ L ~ , E ~ {:(6) widetilde(S)=- widetilde(E)t+ widetilde(L)theta+int^(r)[2(( widetilde(E))+(M)/(r)-( widetilde(L)^(2))/(2r^(2)))]^(1//2)dr+delta_( tilde(L), tilde(E)):}\begin{equation*} \widetilde{S}=-\widetilde{E} t+\widetilde{L} \theta+\int^{r}\left[2\left(\widetilde{E}+\frac{M}{r}-\frac{\widetilde{L}^{2}}{2 r^{2}}\right)\right]^{1 / 2} d r+\delta_{\tilde{L}, \tilde{E}} \tag{6} \end{equation*}(6)S~=E~t+L~θ+r[2(E~+MrL~22r2)]1/2dr+δL~,E~
Shape of orbit:
(7) 0 = S ~ L ~ = θ r L ~ d r / r 2 [ 2 ( E ~ + M / r L ~ 2 / 2 r 2 ) ] 1 / 2 (7) 0 = S ~ L ~ = θ r L ~ d r / r 2 2 E ~ + M / r L ~ 2 / 2 r 2 1 / 2 {:(7)0=(del( widetilde(S)))/(del( widetilde(L)))=theta-int^(r)(( widetilde(L))dr//r^(2))/([2(( widetilde(E))+M//r- widetilde(L)^(2)//2r^(2))]^(1//2)):}\begin{equation*} 0=\frac{\partial \widetilde{S}}{\partial \widetilde{L}}=\theta-\int^{r} \frac{\widetilde{L} d r / r^{2}}{\left[2\left(\widetilde{E}+M / r-\widetilde{L}^{2} / 2 r^{2}\right)\right]^{1 / 2}} \tag{7} \end{equation*}(7)0=S~L~=θrL~dr/r2[2(E~+M/rL~2/2r2)]1/2
whence
(8) r = L ~ 2 / M 1 + e cos θ . (8) r = L ~ 2 / M 1 + e cos θ . {:(8)r=( widetilde(L)^(2)//M)/(1+e cos theta).:}\begin{equation*} r=\frac{\widetilde{L}^{2} / M}{1+e \cos \theta} . \tag{8} \end{equation*}(8)r=L~2/M1+ecosθ.
Here e e eee is an abbreviation for the eccentricity of the orbit,
(9) e = ( 1 + 2 E L 2 ~ / M 2 ) 1 / 2 (9) e = 1 + 2 E L 2 ~ / M 2 1 / 2 {:(9)e=(1+2( widetilde(EL^(2)))//M^(2))^(1//2):}\begin{equation*} e=\left(1+2 \widetilde{E L^{2}} / M^{2}\right)^{1 / 2} \tag{9} \end{equation*}(9)e=(1+2EL2~/M2)1/2
(greater than 1 for positive E ~ E ~ widetilde(E)\widetilde{E}E~, hyperbolic orbit; equal to 1 for zero E ~ E ~ widetilde(E)\widetilde{E}E~, parabolic orbit; less than 1 for negative E ~ E ~ widetilde(E)\widetilde{E}E~, elliptic orbit). A constant of integration has been omitted from (8) for simplicity. To reinstall it, replace θ θ theta\thetaθ by θ θ 0 θ θ 0 theta-theta_(0)\theta-\theta_{0}θθ0 (rotation of direction of principal axis in the plane of the orbit). Other features of the orbit:
(10) ( semimajor axis of orbit when elliptic ) a = L ~ 2 / M 1 e 2 = M ( 2 E ~ ) ; ( semiminor axis of orbit when elliptic ) b = L ~ 2 / M ( 1 e 2 ) 1 / 2 = L ~ ( 2 E ~ ) 1 / 2 ; (10) (  semimajor axis of   orbit when elliptic  ) a = L ~ 2 / M 1 e 2 = M ( 2 E ~ ) ; (  semiminor axis of   orbit when elliptic  ) b = L ~ 2 / M 1 e 2 1 / 2 = L ~ ( 2 E ~ ) 1 / 2 ; {:(10){:[((" semimajor axis of ")/(" orbit when elliptic ")),a=( widetilde(L)^(2)//M)/(1-e^(2))=(M)/((-2( widetilde(E))));],[((" semiminor axis of ")/(" orbit when elliptic ")),b=( widetilde(L)^(2)//M)/((1-e^(2))^(1//2))=(( widetilde(L)))/((-2( widetilde(E)))^(1//2));]:}:}\begin{array}{lr} \binom{\text { semimajor axis of }}{\text { orbit when elliptic }} & a=\frac{\widetilde{L}^{2} / M}{1-e^{2}}=\frac{M}{(-2 \widetilde{E})} ; \tag{10}\\ \binom{\text { semiminor axis of }}{\text { orbit when elliptic }} & b=\frac{\widetilde{L}^{2} / M}{\left(1-e^{2}\right)^{1 / 2}}=\frac{\widetilde{L}}{(-2 \widetilde{E})^{1 / 2}} ; \end{array}(10)( semimajor axis of  orbit when elliptic )a=L~2/M1e2=M(2E~);( semiminor axis of  orbit when elliptic )b=L~2/M(1e2)1/2=L~(2E~)1/2;
(12) ( ( "impact parameter" for hyperbolic orbit, or "distance of closest approach in absence of deflection" ) b = (angular momentum per unit mass) ( linear momentum per unit mass) (12)  "impact parameter"   for hyperbolic orbit,   or "distance of closest   approach in   absence of deflection"  b =  (angular momentum per unit mass)  (  linear momentum per unit mass)  {:(12)(([" "impact parameter" "],[" for hyperbolic orbit, "],[" or "distance of closest "],[" approach in "],[" absence of deflection" "]:})quad b=(" (angular momentum per unit mass) ")/((" linear momentum per unit mass) "):}\left(\begin{array}{l} \left(\begin{array}{l} \text { "impact parameter" } \\ \text { for hyperbolic orbit, } \\ \text { or "distance of closest } \\ \text { approach in } \\ \text { absence of deflection" } \end{array}\right. \tag{12} \end{array}\right) \quad b=\frac{\text { (angular momentum per unit mass) }}{(\text { linear momentum per unit mass) }}(12)(( "impact parameter"  for hyperbolic orbit,  or "distance of closest  approach in  absence of deflection" )b= (angular momentum per unit mass) ( linear momentum per unit mass) 
Θ = π 2 arc cos ( 1 / e ) (14) = 2 arctan [ M / ( 2 E ~ ) 1 / 2 L ~ ] = 2 arctan [ M / 2 E ~ b ] ; Θ = π 2 arc cos ( 1 / e ) (14) = 2 arctan M / ( 2 E ~ ) 1 / 2 L ~ = 2 arctan [ M / 2 E ~ b ] ; {:[Theta=pi-2arc cos(1//e)],[(14)=2arctan[M//(2( widetilde(E)))^(1//2)( widetilde(L))]],[=2arctan[M//2 widetilde(E)b];]:}\begin{align*} \Theta & =\pi-2 \operatorname{arc} \cos (1 / e) \\ & =2 \arctan \left[M /(2 \widetilde{E})^{1 / 2} \widetilde{L}\right] \tag{14}\\ & =2 \arctan [M / 2 \widetilde{E} b] ; \end{align*}Θ=π2arccos(1/e)(14)=2arctan[M/(2E~)1/2L~]=2arctan[M/2E~b];
(13) ( actual distance of closest approach ) r min = L ~ 2 / M ( 1 + 2 E L 2 ~ / M 2 ) 1 / 2 + 1 (13) (  actual distance of   closest approach  ) r min = L ~ 2 / M 1 + 2 E L 2 ~ / M 2 1 / 2 + 1 {:(13)((" actual distance of ")/(" closest approach "))quadr_(min)=( widetilde(L)^(2)//M)/((1+2( widetilde(EL^(2)))//M^(2))^(1//2)+1):}\begin{equation*} \binom{\text { actual distance of }}{\text { closest approach }} \quad r_{\min }=\frac{\widetilde{L}^{2} / M}{\left(1+2 \widetilde{E L^{2}} / M^{2}\right)^{1 / 2}+1} \tag{13} \end{equation*}(13)( actual distance of  closest approach )rmin=L~2/M(1+2EL2~/M2)1/2+1
( angle of deflection in hyperbolic orbit ) (  angle of deflection   in hyperbolic orbit  ) ((" angle of deflection ")/(" in hyperbolic orbit "))\binom{\text { angle of deflection }}{\text { in hyperbolic orbit }}( angle of deflection  in hyperbolic orbit )
( differential scattering cross section ) d σ d Ω = 2 π b d b 2 π sin Θ d Θ (  differential scattering   cross section  ) d σ d Ω = 2 π b d b 2 π sin Θ d Θ ((" differential scattering ")/(" cross section "))quad(d sigma)/(d Omega)=(2pi bdb)/(2pi sin Theta d Theta)\binom{\text { differential scattering }}{\text { cross section }} \quad \frac{d \sigma}{d \Omega}=\frac{2 \pi b d b}{2 \pi \sin \Theta d \boldsymbol{\Theta}}( differential scattering  cross section )dσdΩ=2πbdb2πsinΘdΘ
(15) = M 2 ( 4 E ~ sin 2 Θ / 2 ) 2 (Rutherford). (15) = M 2 4 E ~ sin 2 Θ / 2 2  (Rutherford).  {:(15)=(M^(2))/((4( widetilde(E))sin^(2)Theta//2)^(2))" (Rutherford). ":}\begin{equation*} =\frac{M^{2}}{\left(4 \widetilde{E} \sin ^{2} \Theta / 2\right)^{2}} \text { (Rutherford). } \tag{15} \end{equation*}(15)=M2(4E~sin2Θ/2)2 (Rutherford). 

Box 25.4 (continued)

Time as correlated with position:
(16) 0 = S ~ E ~ = t + r d r [ 2 ( E ~ + M r L ~ 2 2 r 2 ) ] 1 / 2 (16) 0 = S ~ E ~ = t + r d r 2 E ~ + M r L ~ 2 2 r 2 1 / 2 {:(16)0=(del( widetilde(S)))/(del( widetilde(E)))=-t+int^(r)(dr)/([2(( widetilde(E))+(M)/(r)-( widetilde(L)^(2))/(2r^(2)))]^(1//2)):}\begin{equation*} 0=\frac{\partial \widetilde{S}}{\partial \widetilde{E}}=-t+\int^{r} \frac{d r}{\left[2\left(\widetilde{E}+\frac{M}{r}-\frac{\widetilde{L}^{2}}{2 r^{2}}\right)\right]^{1 / 2}} \tag{16} \end{equation*}(16)0=S~E~=t+rdr[2(E~+MrL~22r2)]1/2
Write
(17) r = M ( 2 E ~ ) ( 1 e cos u ) (17) r = M ( 2 E ~ ) ( 1 e cos u ) {:(17)r=(M)/((-2( widetilde(E))))(1-e cos u):}\begin{equation*} r=\frac{M}{(-2 \widetilde{E})}(1-e \cos u) \tag{17} \end{equation*}(17)r=M(2E~)(1ecosu)
to simplify the integration. Get
(18) t = M ( 2 E ~ ) 3 / 2 ( u e sin u ) , (19) ( mean circular frequency ) = 2 π ( period ) = ω = ( 2 E ~ ) 3 / 2 M = ( M a 3 ) 1 / 2 . (18) t = M ( 2 E ~ ) 3 / 2 ( u e sin u ) , (19) (  mean circular   frequency  ) = 2 π (  period  ) = ω = ( 2 E ~ ) 3 / 2 M = M a 3 1 / 2 . {:[(18)t=(M)/((-2( widetilde(E)))^(3//2))(u-e sin u)","],[(19)((" mean circular ")/(" frequency "))=(2pi)/((" period "))=omega=((-2( widetilde(E)))^(3//2))/(M)=((M)/(a^(3)))^(1//2).]:}\begin{gather*} t=\frac{M}{(-2 \widetilde{E})^{3 / 2}}(u-e \sin u), \tag{18}\\ \binom{\text { mean circular }}{\text { frequency }}=\frac{2 \pi}{(\text { period })}=\omega=\frac{(-2 \widetilde{E})^{3 / 2}}{M}=\left(\frac{M}{a^{3}}\right)^{1 / 2} . \tag{19} \end{gather*}(18)t=M(2E~)3/2(uesinu),(19)( mean circular  frequency )=2π( period )=ω=(2E~)3/2M=(Ma3)1/2.
Here u u uuu is the so-called "mean eccentric anomaly" (Bessel's time parameter). In terms of this quantity, one has also:
sin u = ( 1 e 2 ) 1 / 2 sin θ 1 + e cos θ ; cos u = cos θ + e 1 + e cos θ ; cos θ = cos u e 1 e cos u ; sin θ = ( 1 e 2 ) 1 / 2 sin u 1 e cos u ; (20) x = r cos θ = M ( 2 E ~ ) ( cos u e ) ; (21) y = r sin θ = L ~ ( 2 E ~ ) 1 / 2 sin u . sin u = 1 e 2 1 / 2 sin θ 1 + e cos θ ; cos u = cos θ + e 1 + e cos θ ; cos θ = cos u e 1 e cos u ; sin θ = 1 e 2 1 / 2 sin u 1 e cos u ; (20) x = r cos θ = M ( 2 E ~ ) ( cos u e ) ; (21) y = r sin θ = L ~ ( 2 E ~ ) 1 / 2 sin u . {:[sin u=((1-e^(2))^(1//2)sin theta)/(1+e cos theta);],[cos u=(cos theta+e)/(1+e cos theta);],[cos theta=(cos u-e)/(1-e cos u);],[sin theta=((1-e^(2))^(1//2)sin u)/(1-e cos u);],[(20)x=r cos theta=(M)/((-2( widetilde(E))))(cos u-e);],[(21)y=r sin theta=(( widetilde(L)))/((-2( widetilde(E)))^(1//2))sin u.]:}\begin{gather*} \sin u=\frac{\left(1-e^{2}\right)^{1 / 2} \sin \theta}{1+e \cos \theta} ; \\ \cos u=\frac{\cos \theta+e}{1+e \cos \theta} ; \\ \cos \theta=\frac{\cos u-e}{1-e \cos u} ; \\ \sin \theta=\frac{\left(1-e^{2}\right)^{1 / 2} \sin u}{1-e \cos u} ; \\ x=r \cos \theta=\frac{M}{(-2 \widetilde{E})}(\cos u-e) ; \tag{20}\\ y=r \sin \theta=\frac{\widetilde{L}}{(-2 \widetilde{E})^{1 / 2}} \sin u . \tag{21} \end{gather*}sinu=(1e2)1/2sinθ1+ecosθ;cosu=cosθ+e1+ecosθ;cosθ=cosue1ecosu;sinθ=(1e2)1/2sinu1ecosu;(20)x=rcosθ=M(2E~)(cosue);(21)y=rsinθ=L~(2E~)1/2sinu.
These expressions lend themselves to Fourier analysis into harmonic functions of the time, with coefficients that are standard Bessel functions:
(22) J n ( z ) = 1 2 π π π e i ( z sin u n u ) d u (23) x / a = 3 2 e + k = k 0 + k 1 J k 1 ( k e ) cos k ω t ; (24) y / a = ( 1 e 2 ) 1 / 2 k = k 0 + k 1 J k 1 ( k e ) sin k ω t (22) J n ( z ) = 1 2 π π π e i ( z sin u n u ) d u (23) x / a = 3 2 e + k = k 0 + k 1 J k 1 ( k e ) cos k ω t ; (24) y / a = 1 e 2 1 / 2 k = k 0 + k 1 J k 1 ( k e ) sin k ω t {:[(22)J_(n)(z)=(1)/(2pi)int_(-pi)^(pi)e^(i(z sin u-nu))du],[(23)x//a=-(3)/(2)e+sum_({:[k=-oo],[k!=0]:})^(+oo)k^(-1)J_(k-1)(ke)cos k omega t;],[(24)y//a=(1-e^(2))^(1//2)sum_({:[k=-oo],[k!=0]:})^(+oo)k^(-1)J_(k-1)(ke)sin k omega t]:}\begin{gather*} J_{n}(z)=\frac{1}{2 \pi} \int_{-\pi}^{\pi} e^{i(z \sin u-n u)} d u \tag{22}\\ x / a=-\frac{3}{2} e+\sum_{\substack{k=-\infty \\ k \neq 0}}^{+\infty} k^{-1} J_{k-1}(k e) \cos k \omega t ; \tag{23}\\ y / a=\left(1-e^{2}\right)^{1 / 2} \sum_{\substack{k=-\infty \\ k \neq 0}}^{+\infty} k^{-1} J_{k-1}(k e) \sin k \omega t \tag{24} \end{gather*}(22)Jn(z)=12πππei(zsinunu)du(23)x/a=32e+k=k0+k1Jk1(ke)coskωt;(24)y/a=(1e2)1/2k=k0+k1Jk1(ke)sinkωt
[for these and further formulas of this type, see, for example, Wintner (1941), Siegel (1956), and Siegel and Moser (1971)]. Via such Fourier analysis one is in a position to calculate the intensity of gravitational radiation emitted at the fundamental circular frequency ω ω omega\omegaω and at the overtone frequencies (see Chapter 36 ).

B. Einstein's Geometric Theory of Gravitation

Connection between energy and momentum for a test particle of rest mass μ μ mu\muμ traveling in curved space,
(25) g α β p α p β + μ 2 = 0 . (25) g α β p α p β + μ 2 = 0 . {:(25)g^(alpha beta)p_(alpha)p_(beta)+mu^(2)=0.:}\begin{equation*} g^{\alpha \beta} p_{\alpha} p_{\beta}+\mu^{2}=0 . \tag{25} \end{equation*}(25)gαβpαpβ+μ2=0.
Gravitation shows up in no way other than in curvature of the geometry, in which the particle moves as free of all "real" force. Refer all quantities to basis of a test object of unit rest mass by dealing throughout with p ~ = p / μ p ~ = p / μ widetilde(p)=p//mu\widetilde{\boldsymbol{p}}=\boldsymbol{p} / \mup~=p/μ. Also write p ~ α = S ~ / x α p ~ α = S ~ / x α widetilde(p)_(alpha)=del widetilde(S)//delx^(alpha)\widetilde{p}_{\alpha}=\partial \widetilde{S} / \partial x^{\alpha}p~α=S~/xα. Thus Hamilton-Jacobi equation for propagation of wave crests in Schwarzschild geometry (external field of a star; § 23.6 § 23.6 §23.6\S 23.6§23.6 ) becomes
(26) 1 ( 1 2 M / r ) ( S ~ t ) 2 + ( 1 2 M / r ) ( S ~ r ) 2 + 1 r 2 ( S ~ θ ) 2 + 1 r 2 sin 2 θ ( S ~ ϕ ) 2 + 1 = 0 (26) 1 ( 1 2 M / r ) S ~ t 2 + ( 1 2 M / r ) S ~ r 2 + 1 r 2 S ~ θ 2 + 1 r 2 sin 2 θ S ~ ϕ 2 + 1 = 0 {:(26){:[-(1)/((1-2M//r))((del( widetilde(S)))/(del t))^(2)+(1-2M//r)((del( widetilde(S)))/(del r))^(2)+(1)/(r^(2))((del( widetilde(S)))/(del theta))^(2)+(1)/(r^(2)sin^(2)theta)((del( widetilde(S)))/(del phi))^(2)],[+1=0]:}:}\begin{array}{r} -\frac{1}{(1-2 M / r)}\left(\frac{\partial \widetilde{S}}{\partial t}\right)^{2}+(1-2 M / r)\left(\frac{\partial \widetilde{S}}{\partial r}\right)^{2}+\frac{1}{r^{2}}\left(\frac{\partial \widetilde{S}}{\partial \theta}\right)^{2}+\frac{1}{r^{2} \sin ^{2} \theta}\left(\frac{\partial \widetilde{S}}{\partial \phi}\right)^{2} \\ +1=0 \tag{26} \end{array}(26)1(12M/r)(S~t)2+(12M/r)(S~r)2+1r2(S~θ)2+1r2sin2θ(S~ϕ)2+1=0
Solve Hamilton-Jacobi equation. As in Newtonian problem, simplify by eliminating all motion in direction of increasing ϕ ϕ phi\phiϕ. Thus set 0 = p ~ ϕ = S ~ / ϕ 0 = p ~ ϕ = S ~ / ϕ 0= widetilde(p)_(phi)=del widetilde(S)//del phi0=\widetilde{p}_{\phi}=\partial \widetilde{S} / \partial \phi0=p~ϕ=S~/ϕ (dynamic phase independent of ϕ ϕ phi\phiϕ ) and have
(27) S ~ = E ~ t + L ~ θ + r [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 d r ( 1 2 M / r ) (27) S ~ = E ~ t + L ~ θ + r E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 d r ( 1 2 M / r ) {:(27) widetilde(S)=- widetilde(E)t+ widetilde(L)theta+int^(r)[ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2)(dr)/((1-2M//r)):}\begin{equation*} \widetilde{S}=-\widetilde{E} t+\widetilde{L} \theta+\int^{r}\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2} \frac{d r}{(1-2 M / r)} \tag{27} \end{equation*}(27)S~=E~t+L~θ+r[E~2(12M/r)(1+L~2/r2)]1/2dr(12M/r)
Find shape of orbit by "principle of constructive interference"; thus,
(28) 0 = S ~ L ~ = θ r L ~ d r / r 2 [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 (28) 0 = S ~ L ~ = θ r L ~ d r / r 2 E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 {:(28)0=(del( widetilde(S)))/(del( widetilde(L)))=theta-int^(r)(( widetilde(L))dr//r^(2))/([ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2)):}\begin{equation*} 0=\frac{\partial \widetilde{S}}{\partial \widetilde{L}}=\theta-\int^{r} \frac{\widetilde{L} d r / r^{2}}{\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2}} \tag{28} \end{equation*}(28)0=S~L~=θrL~dr/r2[E~2(12M/r)(1+L~2/r2)]1/2
[See equation (25.41) and associated discussion in text; also Figure 25.6.]
Find time to get to given r r rrr by considering "interference of wave crests" belonging to slightly different E ~ E ~ widetilde(E)\widetilde{E}E~ values:
(29) 0 = S ~ E ~ = t + r E ~ [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 d r ( 1 2 M / r ) (29) 0 = S ~ E ~ = t + r E ~ E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 d r ( 1 2 M / r ) {:(29)0=(del( widetilde(S)))/(del( widetilde(E)))=-t+int^(r)(( widetilde(E)))/([ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2))(dr)/((1-2M//r)):}\begin{equation*} 0=\frac{\partial \widetilde{S}}{\partial \widetilde{E}}=-t+\int^{r} \frac{\widetilde{E}}{\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2}} \frac{d r}{(1-2 M / r)} \tag{29} \end{equation*}(29)0=S~E~=t+rE~[E~2(12M/r)(1+L~2/r2)]1/2dr(12M/r)
[See equation (25.32) and associated discussion in text; also Figure 25.5 and exercise 25.15.]

§25.2. SYMMETRIES AND CONSERVATION LAWS

From symmetries to conservation laws by:
(1) Lagrangian or Hamiltonian approach
(2) Killing-vector approach
Killing vector, ξ ξ xi\xiξ, defined
In analytic mechanics, one knows that symmetries of a Lagrangian or Hamiltonian result in conservation laws. Exercises 25.1 to 25.4 describe how these general principles are used to deduce, from the symmetries of Schwarzschild spacetime, constants of motion for the trajectories (geodesics) of freely falling particles in the gravitational field outside a star. The same constants of motion are obtained in a different language in differential geometry, where a "Killing vector" is the standard tool for the description of symmetry. This section considers the general question of metric symmetries before proceeding to Schwarzschild spacetime.
Let the metric components g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν relative to some coordinate basis d x α d x α dx^(alpha)\boldsymbol{d} x^{\alpha}dxα be independent of one of the coordinates x K x K x^(K)x^{K}xK, so
(25.1) g μ ν / x α = 0 for α = K (25.1) g μ ν / x α = 0  for  α = K {:(25.1)delg_(mu nu)//delx^(alpha)=0" for "alpha=K:}\begin{equation*} \partial g_{\mu \nu} / \partial x^{\alpha}=0 \text { for } \alpha=K \tag{25.1} \end{equation*}(25.1)gμν/xα=0 for α=K
This relation possesses a geometric interpretation. Any curve x α = c α ( λ ) x α = c α ( λ ) x^(alpha)=c^(alpha)(lambda)x^{\alpha}=c^{\alpha}(\lambda)xα=cα(λ) can be translated in the x K x K x^(K)x^{K}xK direction by the coordinate shift Δ x K = ε Δ x K = ε Deltax^(K)=epsi\Delta x^{K}=\varepsilonΔxK=ε to form a "congruent" (equivalent) curve:
x α = c α ( λ ) for α K and x K = c K ( λ ) + ε . x α = c α ( λ )  for  α K  and  x K = c K ( λ ) + ε . x^(alpha)=c^(alpha)(lambda)" for "alpha!=K" and "x^(K)=c^(K)(lambda)+epsi.x^{\alpha}=c^{\alpha}(\lambda) \text { for } \alpha \neq K \text { and } x^{K}=c^{K}(\lambda)+\varepsilon .xα=cα(λ) for αK and xK=cK(λ)+ε.
Let the original curve run from λ = λ 1 λ = λ 1 lambda=lambda_(1)\lambda=\lambda_{1}λ=λ1 to λ = λ 2 λ = λ 2 lambda=lambda_(2)\lambda=\lambda_{2}λ=λ2 and have length
L = λ 1 λ 2 [ g μ ν ( x ( λ ) ) ( d x μ / d λ ) ( d x ν / d λ ) ] 1 / 2 d λ . L = λ 1 λ 2 g μ ν ( x ( λ ) ) d x μ / d λ d x ν / d λ 1 / 2 d λ . L=int_(lambda_(1))^(lambda_(2))[g_(mu nu)(x(lambda))(dx^(mu)//d lambda)(dx^(nu)//d lambda)]^(1//2)d lambda.L=\int_{\lambda_{1}}^{\lambda_{2}}\left[g_{\mu \nu}(x(\lambda))\left(d x^{\mu} / d \lambda\right)\left(d x^{\nu} / d \lambda\right)\right]^{1 / 2} d \lambda .L=λ1λ2[gμν(x(λ))(dxμ/dλ)(dxν/dλ)]1/2dλ.
Then the displaced curve has length
L ( ε ) = λ 1 λ 2 [ { g μ ν ( x ( λ ) ) + ε g μ ν x K } ( d x μ / d λ ) ( d x ν / d λ ) ] 1 / 2 d λ . L ( ε ) = λ 1 λ 2 g μ ν ( x ( λ ) ) + ε g μ ν x K d x μ / d λ d x ν / d λ 1 / 2 d λ . L(epsi)=int_(lambda_(1))^(lambda_(2))[{g_(mu nu)(x(lambda))+epsi(delg_(mu nu))/(delx^(K))}(dx^(mu)//d lambda)(dx^(nu)//d lambda)]^(1//2)d lambda.L(\varepsilon)=\int_{\lambda_{1}}^{\lambda_{2}}\left[\left\{g_{\mu \nu}(x(\lambda))+\varepsilon \frac{\partial g_{\mu \nu}}{\partial x^{K}}\right\}\left(d x^{\mu} / d \lambda\right)\left(d x^{\nu} / d \lambda\right)\right]^{1 / 2} d \lambda .L(ε)=λ1λ2[{gμν(x(λ))+εgμνxK}(dxμ/dλ)(dxν/dλ)]1/2dλ.
But the coefficient of ε ε epsi\varepsilonε in the integrand is zero. Therefore the length of the new curve is identical to the length of the original curve: d L / d ε = 0 d L / d ε = 0 dL//d epsi=0d L / d \varepsilon=0dL/dε=0.
The vector
(25.2) ξ d / d ε = ( / x K ) (25.2) ξ d / d ε = / x K {:(25.2)xi-=d//d epsi=(del//delx^(K)):}\begin{equation*} \xi \equiv d / d \varepsilon=\left(\partial / \partial x^{K}\right) \tag{25.2} \end{equation*}(25.2)ξd/dε=(/xK)
provides an infinitesimal description of these length-preserving "translations." It is called a "Killing vector." It satisfies Killing's equation*
(25.3) ξ μ ; p + ξ p ; μ = 0 (25.3) ξ μ ; p + ξ p ; μ = 0 {:(25.3)xi_(mu;p)+xi_(p;mu)=0:}\begin{equation*} \xi_{\mu ; p}+\xi_{p ; \mu}=0 \tag{25.3} \end{equation*}(25.3)ξμ;p+ξp;μ=0
(condition on the vector field ξ ξ xi\xiξ necessary and sufficient to ensure that all lengths are preserved by the displacement ε ξ ε ξ epsi xi\varepsilon \xiεξ ). This condition is expressed in covariant form.
Therefore it is enough to establish it in the preferred coordinate system of (25.1) in order to have it hold in every coordinate system. In that preferred coordinate system, the vector field, according to (25.2), has components
ξ μ = δ μ K ξ μ = δ μ K xi^(mu)=delta^(mu)_(K)\xi^{\mu}=\delta^{\mu}{ }_{K}ξμ=δμK
Therefore the covariant derivative of this vector field has components
ξ μ ; ν = g μ α ξ ; ν α = g μ α ( ξ α x ν + Γ ν σ α ξ σ ) (25.4) = g μ α Γ ν K α = Γ μ ν K = 1 2 ( g μ K x ν + g μ ν x K g ν K x μ ) = 1 2 ( g μ K , ν g ν K , μ ) . ξ μ ; ν = g μ α ξ ; ν α = g μ α ξ α x ν + Γ ν σ α ξ σ (25.4) = g μ α Γ ν K α = Γ μ ν K = 1 2 g μ K x ν + g μ ν x K g ν K x μ = 1 2 g μ K , ν g ν K , μ . {:[xi_(mu;nu)=g_(mu alpha)xi_(;nu)^(alpha)=g_(mu alpha)((delxi^(alpha))/(delx^(nu))+Gamma_(nu sigma)^(alpha)xi^(sigma))],[(25.4)=g_(mu alpha)Gamma_(nu K)^(alpha)=Gamma_(mu nu K)=(1)/(2)((delg_(mu K))/(delx^(nu))+(delg_(mu nu))/(delx^(K))-(delg_(nu K))/(delx^(mu)))],[=(1)/(2)(g_(mu K,nu)-g_(nu K,mu)).]:}\begin{align*} \xi_{\mu ; \nu}=g_{\mu \alpha} \xi_{; \nu}^{\alpha} & =g_{\mu \alpha}\left(\frac{\partial \xi^{\alpha}}{\partial x^{\nu}}+\Gamma_{\nu \sigma}^{\alpha} \xi^{\sigma}\right) \\ & =g_{\mu \alpha} \Gamma_{\nu K}^{\alpha}=\Gamma_{\mu \nu K}=\frac{1}{2}\left(\frac{\partial g_{\mu K}}{\partial x^{\nu}}+\frac{\partial g_{\mu \nu}}{\partial x^{K}}-\frac{\partial g_{\nu K}}{\partial x^{\mu}}\right) \tag{25.4}\\ & =\frac{1}{2}\left(g_{\mu K, \nu}-g_{\nu K, \mu}\right) . \end{align*}ξμ;ν=gμαξ;να=gμα(ξαxν+Γνσαξσ)(25.4)=gμαΓνKα=ΓμνK=12(gμKxν+gμνxKgνKxμ)=12(gμK,νgνK,μ).
One sees that ξ μ ; ν ξ μ ; ν xi_(mu;nu)\xi_{\mu ; \nu}ξμ;ν is antisymmetric in the labels μ μ mu\muμ and ν ν nu\nuν, as stated in Killing's equation (25.3).
The geometric significance of a Killing vector is spelled out in Box 25.5 .
From Killing's equation, ξ ( μ ; p ) = 0 ξ ( μ ; p ) = 0 xi_((mu;p))=0\xi_{(\mu ; p)}=0ξ(μ;p)=0, and from the geodesic equation p p = 0 p p = 0 grad_(p)p=0\boldsymbol{\nabla}_{\boldsymbol{p}} \boldsymbol{p}=0pp=0 for the tangent vector p = d / d λ p = d / d λ p=d//d lambda\boldsymbol{p}=d / d \lambdap=d/dλ to any geodesic, follows an important theorem: In any geometry endowed with a symmetry described by a Killing vector field ξ ξ xi\xiξ, motion along any geodesic whatsoever leaves constant the scalar product of the tangent vector with the Killing vector:
(25.5) p K = p ξ = constant . (25.5) p K = p ξ =  constant  . {:(25.5)p_(K)=p*xi=" constant ".:}\begin{equation*} p_{K}=\boldsymbol{p} \cdot \xi=\text { constant } . \tag{25.5} \end{equation*}(25.5)pK=pξ= constant .
In verification of this result, evaluate the rate of change of the constant p K p K p_(K)p_{K}pK along the course of the typical geodesic (affine parameter λ λ lambda\lambdaλ; result therefore as applicable to light rays-with zero lapse of proper time-as to particles); thus,
(25.6) d p κ / d λ = ( p μ ξ μ ) ; ν p ν = ( p ; ν μ p ν ) ξ μ + p ( μ p ν ξ [ μ ; ν ] = 0 (25.6) d p κ / d λ = p μ ξ μ ; ν p ν = p ; ν μ p ν ξ μ + p ( μ p ν ξ [ μ ; ν ] = 0 {:(25.6)dp_(kappa)//d lambda=(p^(mu)xi_(mu))_(;nu)p^(nu)=(p_(;nu)^(mu)p^(nu))xi_(mu)+p^((mu)p^(nu)xi_([mu;nu])=0:}\begin{equation*} d p_{\kappa} / d \lambda=\left(p^{\mu} \xi_{\mu}\right)_{; \nu} p^{\nu}=\left(p_{; \nu}^{\mu} p^{\nu}\right) \xi_{\mu}+p^{(\mu} p^{\nu} \xi_{[\mu ; \nu]}=0 \tag{25.6} \end{equation*}(25.6)dpκ/dλ=(pμξμ);νpν=(p;νμpν)ξμ+p(μpνξ[μ;ν]=0
Turn back from a general coordinate system to the coordinates of (25.1), where the Killing vector field of the symmetry lets itself be written ξ = / x K ξ = / x K xi=del//delx^(K)\xi=\partial / \partial x^{K}ξ=/xK. Then the scalar product of (25.5) becomes constant p α ξ α = p α δ α K = p K p α ξ α = p α δ α K = p K -=p_(alpha)xi^(alpha)=p_(alpha)delta^(alpha)_(K)=p_(K)\equiv p_{\alpha} \xi^{\alpha}=p_{\alpha} \delta^{\alpha}{ }_{K}=p_{K}pαξα=pαδαK=pK. The symmetry of the geometry guarantees the conservation of the K K KKK-th covariant coordinate-based component of the momentum.
On a timelike geodesic in spacetime, the momentum of a test particle of mass μ μ mu\muμ is
(25.7) p d / d λ = μ u = μ d / d τ (25.7) p d / d λ = μ u = μ d / d τ {:(25.7)p-=d//d lambda=mu u=mu d//d tau:}\begin{equation*} \boldsymbol{p} \equiv d / d \lambda=\mu \boldsymbol{u}=\mu d / d \tau \tag{25.7} \end{equation*}(25.7)pd/dλ=μu=μd/dτ
Thus the affine parameter λ λ lambda\lambdaλ most usefully employed in the above analysis, when it is concerned with a particle, is not proper time τ τ tau\tauτ but rather the ratio λ = τ / μ λ = τ / μ lambda=tau//mu\lambda=\tau / \muλ=τ/μ.
When the metric g μ v g μ v g_(mu v)g_{\mu v}gμv is independent of a coordinate x K x K x^(K)x^{K}xK, that coordinate is called cyclic, and the corresponding conserved quantity, p K p K p_(K)p_{K}pK, is called the "momentum conjugate to that cyclic coordinate" in a terminology borrowed from nonrelativistic mechanics.
Conservation of p ξ p ξ p*xi\boldsymbol{p} \cdot \boldsymbol{\xi}pξ for geodesic motion
Terminology:
"cyclic coordinate," "conjugate momentum"

Box 25.5 KILLING VECTORS AND ISOMETRIES (IIlustrated by a Donut)

A. On a given manifold (e.g., spacetime, or the donut pictured here), in a given coordinate system, the metric components are independent of a particular coordinate x K x K x^(K)x^{K}xK. Example of donut:
d s 2 = b 2 d θ 2 + ( a b cos θ ) 2 d ϕ 2 g μ ν independent of x K = ϕ . d s 2 = b 2 d θ 2 + ( a b cos θ ) 2 d ϕ 2 g μ ν  independent of  x K = ϕ . {:[ds^(2)=b^(2)dtheta^(2)+(a-b cos theta)^(2)dphi^(2)],[g_(mu nu)" independent of "x^(K)=phi.]:}\begin{gathered} d s^{2}=b^{2} d \theta^{2}+(a-b \cos \theta)^{2} d \phi^{2} \\ g_{\mu \nu} \text { independent of } x^{K}=\phi . \end{gathered}ds2=b2dθ2+(abcosθ)2dϕ2gμν independent of xK=ϕ.

B. Translate an arbitrary curve C C C\mathcal{C}C through the infinitesimal displacement
ε ξ ε ( / x K ) = ε ( / ϕ ) , ε 1 ε ξ ε / x K = ε ( / ϕ ) , ε 1 epsi xi-=epsi(del//delx^(K))=epsi(del//del phi),quad epsi≪1\varepsilon \xi \equiv \varepsilon\left(\partial / \partial x^{K}\right)=\varepsilon(\partial / \partial \phi), \quad \varepsilon \ll 1εξε(/xK)=ε(/ϕ),ε1
to form a new curve C C C^(')\mathcal{C}^{\prime}C. In coordinate language C C C\mathcal{C}C is θ = θ ( λ ) , ϕ = ϕ ( λ ) θ = θ ( λ ) , ϕ = ϕ ( λ ) theta=theta(lambda),phi=phi(lambda)\theta=\theta(\lambda), \phi=\phi(\lambda)θ=θ(λ),ϕ=ϕ(λ); while C C C^(')\mathcal{C}^{\prime}C is θ = θ ( λ ) , ϕ = θ = θ ( λ ) , ϕ = theta=theta(lambda),phi=\theta=\theta(\lambda), \phi=θ=θ(λ),ϕ= ϕ ( λ ) + ε ϕ ( λ ) + ε phi(lambda)+epsi\phi(\lambda)+\varepsilonϕ(λ)+ε. (Translation of all points through Δ ϕ = ε Δ ϕ = ε Delta phi=epsi\Delta \phi=\varepsilonΔϕ=ε.) Because g μ ν / ϕ = 0 g μ ν / ϕ = 0 delg_(mu nu)//del phi=0\partial g_{\mu \nu} / \partial \phi=0gμν/ϕ=0, the curves C C C\mathcal{C}C and C C C^(')\mathcal{C}^{\prime}C have the same length (see text).
C. Pick a set of neighboring points a , B , C , D a , B , C , D a,B,C,Da, \mathscr{B}, \mathcal{C}, \mathscr{D}a,B,C,D; and translate each of them through ε ξ ε ξ epsi xi\varepsilon \xiεξ to obtain points a , B , C , Q a , B , C , Q a^('),B^('),C^('),Q^(')\mathscr{a}^{\prime}, \mathscr{B}^{\prime}, \mathcal{C}^{\prime}, \mathscr{Q}^{\prime}a,B,C,Q. Since the length of every curve is preserved by this translation, the distances between neighboring points are also preserved:
( (:}\left(\right.( distance between a a a^(')\mathscr{a}^{\prime}a and B ) = B = {:B^('))=\left.\mathscr{B}^{\prime}\right)=B)=
(distance between C C C\mathscr{C}C and B B B\mathscr{B}B ).
But geometry is equivalent to a table of all infinitesimal distances (see Box 13.1). Thus the geometry of the manifold is left completely unchanged by a translation of all points through ε ξ ε ξ epsi xi\varepsilon \xiεξ. [This is the coordinatefree version of the statement g μ ν / ϕ = 0 g μ ν / ϕ = 0 delg_(mu nu)//del phi=0\partial g_{\mu \nu} / \partial \phi=0gμν/ϕ=0.] One says that ξ = / ϕ ξ = / ϕ xi=del//del phi\xi=\partial / \partial \phiξ=/ϕ is the generator of an "isometry" (or "group of motions") on the manifold.
D. In general (see text), a vector field ξ ( P ) ξ ( P ) xi(P)\xi(\mathscr{P})ξ(P) generates an isometry if and only if it satisfies Killing's equation ξ ( α ; β ) = 0 ξ ( α ; β ) = 0 xi_((alpha;beta))=0\xi_{(\alpha ; \beta)}=0ξ(α;β)=0.
E. If ξ ( P ) ξ ( P ) xi(P)\boldsymbol{\xi}(\mathscr{P})ξ(P) generates an isometry (i.e. if ξ ξ xi\boldsymbol{\xi}ξ is a "Killing vector"), then the curves

to which ξ ξ xi\xiξ is tangent [ ξ = ( P / x K ) α 1 , , α n ] ξ = P / x K α 1 , , α n [xi=(delP//delx^(K))_(alpha_(1),dots,alpha_(n))]\left[\xi=\left(\partial \mathscr{P} / \partial x^{K}\right)_{\alpha_{1}, \ldots, \alpha_{n}}\right][ξ=(P/xK)α1,,αn] are called "trajectories of the isometry."
F. The geometry is invariant under a translation of all points of the manifold through the same Δ x K Δ x K Deltax^(K)\Delta x^{K}ΔxK along these trajectories [ P ( x K , α 1 , , α n ) P ( x K + P x K , α 1 , , α n P x K + [P(x^(K),alpha_(1),dots,alpha_(n))longrightarrowP(x^(K)+:}\left[\mathscr{P}\left(x^{K}, \alpha_{1}, \ldots, \alpha_{n}\right) \longrightarrow \mathscr{P}\left(x^{K}+\right.\right.[P(xK,α1,,αn)P(xK+ Δ x K , α 1 , , α n Δ x K , α 1 , , α n Deltax^(K),alpha_(1),dots,alpha_(n)\Delta x^{K}, \alpha_{1}, \ldots, \alpha_{n}ΔxK,α1,,αn ); "finite motion" built up from many "infinitesimal motions" ε ξ ε ξ epsi xi\varepsilon \xiεξ.]
G. This isometry is described in physical terms as follows. Station a family of observers throughout the manifold. Have each observer report to a central computer (1) all aspects of the manifold's geometry near him, and (2) the distances and directions to all neighboring observers (directions relative to "preferred" directions that are determined by anisotropies in the local geometry). Through each observer's position passes a unique trajectory of the isometry. Move each observer through the same fixed Δ x K Δ x K Deltax^(K)\Delta x^{K}ΔxK (e.g., Δ x K = 17 Δ x K = 17 Deltax^(K)=17\Delta x^{K}=17ΔxK=17 ) along his trajectory, leaving the manifold itself unchanged. Then have each observer report to the central computer the same geometric information as before his motion. The information received by the computer after the motion will be identical to that received before the motion. There is no way whatsoever, by geometric measurements, to discover that the motion has occurred! This is the significance of an isometry.
Three different trajectories on a donut
Parameter on trajectories is x K = ϕ x K = ϕ x^(K)=phix^{K}=\phixK=ϕ

EXERCISES

Exercise 25.1. CONSTANT OF MOTION OBTAINED FROM HAMILTON'S PRINCIPLE

Prove the above theorem of conservation of p K p ξ p K p ξ p_(K)-=p*xip_{K} \equiv \boldsymbol{p} \cdot \boldsymbol{\xi}pKpξ from Hamilton's principle (Box 13.3)
(25.8) δ 1 2 g μ ν ( x ) ( d x μ / d λ ) ( d x ν / d λ ) d λ = 0 (25.8) δ 1 2 g μ ν ( x ) d x μ / d λ d x ν / d λ d λ = 0 {:(25.8)delta int(1)/(2)g_(mu nu)(x)(dx^(mu)//d lambda)(dx^(nu)//d lambda)d lambda=0:}\begin{equation*} \delta \int \frac{1}{2} g_{\mu \nu}(x)\left(d x^{\mu} / d \lambda\right)\left(d x^{\nu} / d \lambda\right) d \lambda=0 \tag{25.8} \end{equation*}(25.8)δ12gμν(x)(dxμ/dλ)(dxν/dλ)dλ=0
as applied to geodesic paths. Recall: In this action principle, g μ g μ g_(mu)g_{\mu}gμ, is to be regarded as a known function of position, x x xxx, along the path; and the path itself, x μ ( λ ) x μ ( λ ) x^(mu)(lambda)x^{\mu}(\lambda)xμ(λ), is to be varied.

Exercise 25.2. SUPER-HAMILTONIAN FORMALISM FOR GEODESIC MOTION

Show that a set of differential equations in Hamiltonian form results from varying p μ p μ p_(mu)p_{\mu}pμ and x μ x μ x^(mu)x^{\mu}xμ independently in the variational principle δ I = 0 δ I = 0 delta I=0\delta I=0δI=0, where
(25.9) I = ( p μ d x μ H d λ ) (25.9) I = p μ d x μ H d λ {:(25.9)I=int(p_(mu)dx^(mu)-Hd lambda):}\begin{equation*} I=\int\left(p_{\mu} d x^{\mu}-\mathscr{H} d \lambda\right) \tag{25.9} \end{equation*}(25.9)I=(pμdxμHdλ)
and
(25.10) H 1 2 g μ ν ( x ) p μ p v (25.10) H 1 2 g μ ν ( x ) p μ p v {:(25.10)H-=(1)/(2)g^(mu nu)(x)p_(mu)p_(v):}\begin{equation*} \mathscr{H} \equiv \frac{1}{2} g^{\mu \nu}(x) p_{\mu} p_{v} \tag{25.10} \end{equation*}(25.10)H12gμν(x)pμpv
Show that the "super-Hamiltonian" H H H\mathscr{H}H is a constant of motion, and that the solutions of these equations are geodesics. What do the choices K = + 1 2 , K = 0 , K = 1 2 μ 2 K = + 1 2 , K = 0 , K = 1 2 μ 2 K=+(1)/(2),K=0,K=-(1)/(2)mu^(2)\mathscr{K}=+\frac{1}{2}, \mathscr{K}=0, \mathscr{K}=-\frac{1}{2} \mu^{2}K=+12,K=0,K=12μ2, or K = 1 2 K = 1 2 K=-(1)/(2)\mathscr{K}=-\frac{1}{2}K=12 mean for the geodesic and its parametrization?

Exercise 25.3. KILLING VECTORS IN FLAT SPACETIME

Find ten Killing vectors in flat Minkowski spacetime that are linearly independent. (Restrict attention to linear relationships with constant coefficients).

Exercise 25.4. POISSON BRACKET AS KEY TO CONSTANTS OF MOTION

If ξ ξ xi\boldsymbol{\xi}ξ is a Killing vector, show that p K ξ μ p μ p K ξ μ p μ p_(K)-=xi^(mu)p_(mu)p_{K} \equiv \xi^{\mu} p_{\mu}pKξμpμ commutes (has vanishing Poisson bracket) with the Hamiltonian K K K\mathscr{K}K of the previous problem, [ K , p K ] = 0 K , p K = 0 [K,p_(K)]=0\left[\mathscr{K}, p_{K}\right]=0[K,pK]=0, so d p κ / d λ = 0 d p κ / d λ = 0 dp_(kappa)//d lambda=0d p_{\kappa} / d \lambda=0dpκ/dλ=0. (Hint: Use a convenient coordinate system.)

Exercise 25.5. COMMUTATOR OF KILLING VECTORS IS A KILLING VECTOR

Consider two Killing vectors, ξ ξ xi\boldsymbol{\xi}ξ and η η eta\boldsymbol{\eta}η, which happen not to commute [as differential operators; i.e., the commutator of equations (8.13) does not vanish; consider rotations about perpendicular directions as a case in point]; thus,
[ ξ , η ] ζ 0 [ ξ , η ] ζ 0 [xi,eta]-=zeta!=0[\xi, \eta] \equiv \zeta \neq 0[ξ,η]ζ0
(a) Show that no single coordinate system can be simultaneously adapted, in the sense of equation (25.1), to both the ξ ξ xi\xiξ and η η eta\boldsymbol{\eta}η symmetries (see exercise 9.9).
(b) Let p ξ = p μ ξ μ , p η = p μ η μ p ξ = p μ ξ μ , p η = p μ η μ p_(xi)=p_(mu)xi^(mu),p_(eta)=p_(mu)eta^(mu)p_{\xi}=p_{\mu} \xi^{\mu}, p_{\eta}=p_{\mu} \eta^{\mu}pξ=pμξμ,pη=pμημ, and p ξ = p μ ζ μ p ξ = p μ ζ μ p_(xi)=p_(mu)zeta^(mu)p_{\xi}=p_{\mu} \zeta^{\mu}pξ=pμζμ, and derive the Poisson-bracket relationship [ p ξ , p η ] = p ξ p ξ , p η = p ξ [p_(xi),p_(eta)]=-p_(xi)\left[p_{\xi}, p_{\eta}\right]=-p_{\xi}[pξ,pη]=pξ. In a geometry, the symmetries of which are related in this way, show that p ξ p ξ p_(xi)p_{\xi}pξ is also a constant of motion.
(c) In a coordinate system where ζ = ( / x K ) ζ = / x K zeta=(del//delx^(K))\zeta=\left(\partial / \partial x^{K}\right)ζ=(/xK), define K K K\mathscr{K}K as in (25.10) and show from [ K , p ξ ] = 0 K , p ξ = 0 [K,p_(xi)]=0\left[\mathcal{K}, p_{\xi}\right]=0[K,pξ]=0 that ζ ζ zeta\zetaζ is a Killing vector.
Thus the commutator of two Killing vectors is itself a Killing vector.

Exercise 25.6. EIGENVALUE PROBLEM FOR KILLING VECTORS

Show that any Killing vector satisfies ξ μ ; μ = 0 ξ μ ; μ = 0 xi^(mu)_(;mu)=0\xi^{\mu}{ }_{; \mu}=0ξμ;μ=0, and is an eigenvector with eigenvalue κ = 0 κ = 0 kappa=0\kappa=0κ=0 of the equation
(25.11) ξ μ ; ν ; p + R μ ν ξ ν = κ ξ μ . (25.11) ξ μ ; ν ; p + R μ ν ξ ν = κ ξ μ . {:(25.11)xi^(mu;nu)_(;p)+R^(mu)_(nu)xi^(nu)=kappaxi^(mu).:}\begin{equation*} \xi^{\mu ; \nu}{ }_{; p}+R^{\mu}{ }_{\nu} \xi^{\nu}=\kappa \xi^{\mu} . \tag{25.11} \end{equation*}(25.11)ξμ;ν;p+Rμνξν=κξμ.
Find a variational principle (Raleigh-Ritz type) for this eigenvalue equation.

§25.3. CONSERVED QUANTITIES FOR MOTION IN SCHWARZSCHILD GEOMETRY

Consider a test particle moving in the Schwarzschild geometry, described by the line element
(25.12) d s 2 = ( 1 2 M / r ) d t 2 + d r 2 ( 1 2 M / r ) + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) (25.12) d s 2 = ( 1 2 M / r ) d t 2 + d r 2 ( 1 2 M / r ) + r 2 d θ 2 + sin 2 θ d ϕ 2 {:(25.12)ds^(2)=-(1-2M//r)dt^(2)+(dr^(2))/((1-2M//r))+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2)):}\begin{equation*} d s^{2}=-(1-2 M / r) d t^{2}+\frac{d r^{2}}{(1-2 M / r)}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \tag{25.12} \end{equation*}(25.12)ds2=(12M/r)dt2+dr2(12M/r)+r2(dθ2+sin2θdϕ2)
This expression for the geometry applies outside any spherically symmetric center of attraction of total mass-energy M M MMM. It makes no difference, for the motion of the particle outside, what the geometry is inside, because the particle never gets there; before it can, it collides with the surface of the star-if the center of attraction is a star, that is to say, a fluid mass in hydrostatic equilibrium. At each point throughout such an equilibrium configuration, the Schwarzschild coordinate r r rrr exceeds the local value of the quantity 2 m ( r ) 2 m ( r ) 2m(r)2 m(r)2m(r); see § 23.8 § 23.8 §23.8\S 23.8§23.8. Therefore the Schwarzschild coordinate R R RRR of the surface exceeds 2 M 2 M 2M2 M2M. Consequently, expression (25.12) applies outside any equilibrium configuration, no matter how compact ( r > R > 2 M ( r > R > 2 M (r > R > 2M(r>R>2 M(r>R>2M implies that one need not face the issue of the "singularity" at r = 2 M r = 2 M r=2Mr=2 Mr=2M ). The more compact the configuration, however, the more of the Schwarzschild geometry the test particle can explore. The ideal limit is not a star in hydrostatic equilibrium. It is a star that has undergone complete gravitational collapse to a black hole. Then (25.12) applies arbitrarily close to r = 2 M r = 2 M r=2Mr=2 \mathrm{M}r=2M. This idealization is assumed here ("black hole"), because the analysis can then cover the maximum range of possible situations.
Wherever the test particle lies, and however fast it moves, project that point and project that 3 -velocity radially onto a sphere of some fixed r r rrr value, say, the unit sphere r = 1 r = 1 r=1r=1r=1. The point and the vector together define a point and a vector on the surface of the unit sphere; and they in turn mark the beginning and define the totality of a great circle. As the particle continues on its way, the radial projection of its position will continue to lie on that great circle. To depart from the great circle on one side or the other would be to give preference to the one hemisphere or the other of the unit sphere, contrary to the symmetry of the situation.
Orient the coordinate system so that the radial projection of the orbit coincides with the equator, θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2, of the polar coordinates (see Box 25.4 for the spherical trigonometry of a more general orientation of the orbit, and for eventual specializa-
Why attention focuses on particle orbits around a black hole
Choice of coordinates to make particle orbit lie in "equator," θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2
tion to a polar orbit, in contrast to the equatorial orbit considered here). In polar coordinates as so oriented, the particle has at the start, and continues to have, zero momentum in the θ θ theta\thetaθ direction; thus,
p θ = d θ / d λ = 0 . p θ = d θ / d λ = 0 . p^(theta)=d theta//d lambda=0.p^{\theta}=d \theta / d \lambda=0 .pθ=dθ/dλ=0.
Conserved quantities for particle motion:
(1) E E EEE
(2) L L LLL
(3) μ μ mu\muμ
(4) E ~ E / μ E ~ E / μ widetilde(E)-=E//mu\widetilde{E} \equiv E / \muE~E/μ
(5) L ~ L / μ L ~ L / μ tilde(L)-=L//mu\tilde{L} \equiv L / \muL~L/μ
Effective potential V ~ V ~ widetilde(V)\widetilde{V}V~, and equations for orbit when μ 0 μ 0 mu!=0\mu \neq 0μ0
The expression (25.12) for the line element shows that the geometry is unaffected by the translations t t + Δ t , ϕ ϕ + Δ ϕ t t + Δ t , ϕ ϕ + Δ ϕ t longrightarrow t+Delta t,phi longrightarrow phi+Delta phit \longrightarrow t+\Delta t, \phi \longrightarrow \phi+\Delta \phitt+Δt,ϕϕ+Δϕ. Thus the coordinates t t ttt and ϕ ϕ phi\phiϕ are "cyclic." The conjugate momenta p 0 E p 0 E p_(0)-=-Ep_{0} \equiv-Ep0E and p ϕ ± L ( L 0 ) p ϕ ± L ( L 0 ) p_(phi)-=+-L(L >= 0)p_{\phi} \equiv \pm L(L \geq 0)pϕ±L(L0) are therefore conserved. This circumstance allows one immediately to deduce the major features of the motion, as follows.
The magnitude of the 4 -vector of energy-momentum is given by the rest mass of the particle,
(25.13) g α β p α p β + μ 2 = g α β p α p β + μ 2 = 0 (25.13) g α β p α p β + μ 2 = g α β p α p β + μ 2 = 0 {:(25.13)g_(alpha beta)p^(alpha)p^(beta)+mu^(2)=g^(alpha beta)p_(alpha)p_(beta)+mu^(2)=0:}\begin{equation*} g_{\alpha \beta} p^{\alpha} p^{\beta}+\mu^{2}=g^{\alpha \beta} p_{\alpha} p_{\beta}+\mu^{2}=0 \tag{25.13} \end{equation*}(25.13)gαβpαpβ+μ2=gαβpαpβ+μ2=0
or
(25.14) E 2 ( 1 2 M / r ) + 1 ( 1 2 M / r ) ( d r d λ ) 2 + L 2 r 2 + μ 2 = 0 . (25.14) E 2 ( 1 2 M / r ) + 1 ( 1 2 M / r ) d r d λ 2 + L 2 r 2 + μ 2 = 0 . {:(25.14)-(E^(2))/((1-2M//r))+(1)/((1-2M//r))((dr)/(d lambda))^(2)+(L^(2))/(r^(2))+mu^(2)=0.:}\begin{equation*} -\frac{E^{2}}{(1-2 M / r)}+\frac{1}{(1-2 M / r)}\left(\frac{d r}{d \lambda}\right)^{2}+\frac{L^{2}}{r^{2}}+\mu^{2}=0 . \tag{25.14} \end{equation*}(25.14)E2(12M/r)+1(12M/r)(drdλ)2+L2r2+μ2=0.
Moreover, one knows from the equivalence principle that test particles follow the same world line regardless of mass. Therefore what is relevant for the motion of particles is not the energy and angular momentum themselves, but the ratios
E ~ = E / μ = ( energy per unit rest mass ) , (25.15) L ~ = L / μ = ( angular momentum per unit rest mass ) . E ~ = E / μ = (  energy per unit   rest mass  ) , (25.15) L ~ = L / μ = (  angular momentum   per unit rest mass  ) . {:[ widetilde(E)=E//mu=((" energy per unit ")/(" rest mass "))","],[(25.15) widetilde(L)=L//mu=((" angular momentum ")/(" per unit rest mass ")).]:}\begin{gather*} \widetilde{E}=E / \mu=\binom{\text { energy per unit }}{\text { rest mass }}, \\ \widetilde{L}=L / \mu=\binom{\text { angular momentum }}{\text { per unit rest mass }} . \tag{25.15} \end{gather*}E~=E/μ=( energy per unit  rest mass ),(25.15)L~=L/μ=( angular momentum  per unit rest mass ).
Recall also
λ = τ / μ = ( proper time per unit rest mass ) . λ = τ / μ = (  proper time per   unit rest mass  ) . lambda=tau//mu=((" proper time per ")/(" unit rest mass ")).\lambda=\tau / \mu=\binom{\text { proper time per }}{\text { unit rest mass }} .λ=τ/μ=( proper time per  unit rest mass ).
Then (25.14) becomes an equation for the change of r r rrr-coordinate with proper time in which the rest mass makes no appearance:
( d r d τ ) 2 = E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) (25.16a) = E ~ 2 V ~ 2 ( r ) d r d τ 2 = E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 (25.16a) = E ~ 2 V ~ 2 ( r ) {:[((dr)/(d tau))^(2)= widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))],[(25.16a)= widetilde(E)^(2)- widetilde(V)^(2)(r)]:}\begin{align*} \left(\frac{d r}{d \tau}\right)^{2} & =\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right) \\ & =\widetilde{E}^{2}-\widetilde{V}^{2}(r) \tag{25.16a} \end{align*}(drdτ)2=E~2(12M/r)(1+L~2/r2)(25.16a)=E~2V~2(r)
Here
(25.16b) V ~ ( r ) = [ ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 (25.16b) V ~ ( r ) = ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 {:(25.16b) widetilde(V)(r)=[(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2):}\begin{equation*} \widetilde{V}(r)=\left[(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2} \tag{25.16b} \end{equation*}(25.16b)V~(r)=[(12M/r)(1+L~2/r2)]1/2
is the "effective potential" mentioned in § 25.1 § 25.1 §25.1\S 25.1§25.1 and Figure 25.2 and to be discussed
in $ 25.5 $ 25.5 $25.5\$ 25.5$25.5. For the rate of change of the other two relevant coordinates with proper time, one has, assuming a "direct" orbit ( d ϕ / d τ > 0 ; p ϕ = + L d ϕ / d τ > 0 ; p ϕ = + L (d phi//d tau > 0;p_(phi)=+L:}\left(d \phi / d \tau>0 ; p_{\phi}=+L\right.(dϕ/dτ>0;pϕ=+L rather than L ) L {:-L)\left.-L\right)L),
(25.17) d ϕ d τ = 1 μ d ϕ d λ = p ϕ μ = g ϕ ϕ L μ = L ~ r 2 (25.17) d ϕ d τ = 1 μ d ϕ d λ = p ϕ μ = g ϕ ϕ L μ = L ~ r 2 {:(25.17)(d phi)/(d tau)=(1)/(mu)(d phi)/(d lambda)=(p^(phi))/(mu)=(g^(phi phi)L)/(mu)=(( widetilde(L)))/(r^(2)):}\begin{equation*} \frac{d \phi}{d \tau}=\frac{1}{\mu} \frac{d \phi}{d \lambda}=\frac{p^{\phi}}{\mu}=\frac{g^{\phi \phi} L}{\mu}=\frac{\widetilde{L}}{r^{2}} \tag{25.17} \end{equation*}(25.17)dϕdτ=1μdϕdλ=pϕμ=gϕϕLμ=L~r2
and
(25.18) d t d τ = 1 μ d t d λ = p 0 μ = g 00 E μ = E ~ 1 2 M / r (25.18) d t d τ = 1 μ d t d λ = p 0 μ = g 00 E μ = E ~ 1 2 M / r {:(25.18)(dt)/(d tau)=(1)/(mu)(dt)/(d lambda)=(p^(0))/(mu)=-(g^(00)E)/(mu)=(( widetilde(E)))/(1-2M//r):}\begin{equation*} \frac{d t}{d \tau}=\frac{1}{\mu} \frac{d t}{d \lambda}=\frac{p^{0}}{\mu}=-\frac{g^{00} E}{\mu}=\frac{\widetilde{E}}{1-2 M / r} \tag{25.18} \end{equation*}(25.18)dtdτ=1μdtdλ=p0μ=g00Eμ=E~12M/r
Knowing r r rrr as a function of τ τ tau\tauτ from (25.16), one can find ϕ ϕ phi\phiϕ and t t ttt in their dependence on τ τ tau\tauτ from (25.17) and (25.18). Symmetry considerations have in effect reduced the four coupled second-order differential equations p μ ; p p ν = 0 p μ ; p p ν = 0 p^(mu)_(;p)p^(nu)=0p^{\mu}{ }_{; p} p^{\nu}=0pμ;ppν=0 of geodesic motion to the single first-order equation (25.16).
For objects of zero rest mass, it makes no sense to refer to proper time, and a slightly different treatment is appropriate ( $ 25.6 $ 25.6 $25.6\$ 25.6$25.6 ).
Before looking, in § 25.5 § 25.5 §25.5\S 25.5§25.5, at the motions predicted by equations (25.16) to (25.18), it is useful to analyze the physical significance of the constants p 0 p 0 p_(0)p_{0}p0 and p ϕ p ϕ p_(phi)p_{\phi}pϕ, and to identify other physically significant quantities whose values will be of interest in studying these orbits. One calls E = p 0 E = p 0 E=-p_(0)E=-p_{0}E=p0 the "energy at infinity"; and L = | p ϕ | L = p ϕ L=|p_(phi)|L=\left|p_{\phi}\right|L=|pϕ|, for equatorial orbits, the "total angular momentum." To justify these names, compare them with standard quantities measured by an observer at rest on the equator of the Schwarzschild coordinate system as the test particle flies past him in its orbit. Let
E local p 0 ^ ω 0 ^ , p | g 00 | 1 / 2 d t , p = | g 00 | 1 / 2 p 0 (25.19) = | g 00 | 1 / 2 d t / d λ = ( 1 2 M / r ) 1 / 2 d t / d λ E local  p 0 ^ ω 0 ^ , p | g 00 1 / 2 d t , p = g 00 1 / 2 p 0 (25.19) = g 00 1 / 2 d t / d λ = ( 1 2 M / r ) 1 / 2 d t / d λ {:[E_("local "){:-=p^( hat(0))-=(:omega^( hat(0)),p:)-=(:|g_(00)|^(1//2)dt,p:)=|g_(00)|^(1//2)p^(0)],[(25.19)=|g_(00)|^(1//2)dt//d lambda=(1-2M//r)^(1//2)dt//d lambda]:}\begin{align*} E_{\text {local }} & \left.\left.\equiv p^{\hat{0}} \equiv\left\langle\boldsymbol{\omega}^{\hat{0}}, \boldsymbol{p}\right\rangle \equiv\langle | g_{00}\right|^{1 / 2} \boldsymbol{d} t, \boldsymbol{p}\right\rangle=\left|g_{00}\right|^{1 / 2} p^{0} \\ & =\left|g_{00}\right|^{1 / 2} d t / d \lambda=(1-2 M / r)^{1 / 2} d t / d \lambda \tag{25.19} \end{align*}Elocal p0^ω0^,p|g00|1/2dt,p=|g00|1/2p0(25.19)=|g00|1/2dt/dλ=(12M/r)1/2dt/dλ
be the energy he measures in his proper reference frame, and let
(25.20) v ϕ ^ p ϕ ^ p 0 ^ ω ^ , p E local = | g ϕ ϕ | 1 / 2 d ϕ , d / d λ E local = r ( d ϕ / d λ ) E local = p ϕ r E local (25.20) v ϕ ^ p ϕ ^ p 0 ^ ω ^ , p E local  = | g ϕ ϕ 1 / 2 d ϕ , d / d λ E local  = r ( d ϕ / d λ ) E local  = p ϕ r E local  {:[(25.20)v_( hat(phi))-=(p^( hat(phi)))/(p^( hat(0)))-=((:omega^( hat()),p:))/(E_("local "))=((:|g_(phi phi)|^(1//2)d phi,d//d lambda:))/(E_("local "))],[=(r(d phi//d lambda))/(E_("local "))=(p_(phi))/(rE_("local "))]:}\begin{align*} v_{\hat{\phi}} & \equiv \frac{p^{\hat{\phi}}}{p^{\hat{0}}} \equiv \frac{\left\langle\boldsymbol{\omega}^{\hat{}}, \boldsymbol{p}\right\rangle}{E_{\text {local }}}=\frac{\left.\left.\langle | g_{\phi \phi}\right|^{1 / 2} \boldsymbol{d} \phi, d / d \lambda\right\rangle}{E_{\text {local }}} \tag{25.20}\\ & =\frac{r(d \phi / d \lambda)}{E_{\text {local }}}=\frac{p_{\phi}}{r E_{\text {local }}} \end{align*}(25.20)vϕ^pϕ^p0^ω^,pElocal =|gϕϕ|1/2dϕ,d/dλElocal =r(dϕ/dλ)Elocal =pϕrElocal 
be the tangential component of the ordinary velocity he measures. [Note: ω α ^ ω α ^ omega^( hat(alpha))\boldsymbol{\omega}^{\hat{\alpha}}ωα^ are the basis one-forms of the observer's proper reference frame; see equations ( 23.15 a , b 23.15 a , b 23.15a,b23.15 \mathrm{a}, \mathrm{b}23.15a,b ).] In terms of these locally measured quantities, the energy-at-infinity is
(25.21) E = p 0 = g 00 p 0 = | g 00 | 1 / 2 E local = ( 1 2 M / r ) 1 / 2 E local = constant. (25.21) E = p 0 = g 00 p 0 = g 00 1 / 2 E local  = ( 1 2 M / r ) 1 / 2 E local  =  constant.  {:(25.21)E=-p_(0)=-g_(00)p^(0)=|g_(00)|^(1//2)E_("local ")=(1-2M//r)^(1//2)E_("local ")=" constant. ":}\begin{equation*} E=-p_{0}=-g_{00} p^{0}=\left|g_{00}\right|^{1 / 2} E_{\text {local }}=(1-2 M / r)^{1 / 2} E_{\text {local }}=\text { constant. } \tag{25.21} \end{equation*}(25.21)E=p0=g00p0=|g00|1/2Elocal =(12M/r)1/2Elocal = constant. 
It therefore represents the locally measured energy E local E local  E_("local ")E_{\text {local }}Elocal , corrected by a factor | g 00 | 1 / 2 g 00 1 / 2 |g_(00)|^(1//2)\left|g_{00}\right|^{1 / 2}|g00|1/2. For any particle that flies freely (geodesic motion) from this observer to r = r = r=oor=\inftyr=, the correction factor reduces to unity, and E local E local  E_("local ")E_{\text {local }}Elocal  (as measured by a second observer, this time at infinity) becomes identical with E E EEE. Similarly the angular momentum from (25.20) is
(25.22) p ϕ = E local O ^ 2 ϕ r = constant. (25.22) p ϕ = E local  O ^ 2 ϕ r =  constant.  {:(25.22)p_(phi)=E_("local ") hat(O)_(2^(phi))r=" constant. ":}\begin{equation*} p_{\phi}=E_{\text {local }} \hat{O}_{\stackrel{\phi}{2}} r=\text { constant. } \tag{25.22} \end{equation*}(25.22)pϕ=Elocal O^2ϕr= constant. 

Interpretation of E E EEE as

"energy at infinity" and L L LLL as "angular momentum"
This, like E = p 0 E = p 0 E=-p_(0)E=-p_{0}E=p0, represents a quantity that is conserved, and whose interpretation for r r r longrightarrow oor \longrightarrow \inftyr on any orbit is familiar. Finally, recall that the total 4 -momentum of two colliding particles p 1 + p 2 p 1 + p 2 p_(1)+p_(2)\boldsymbol{p}_{1}+\boldsymbol{p}_{2}p1+p2 or ( p μ ) 1 + ( p μ ) 2 p μ 1 + p μ 2 (p_(mu))_(1)+(p_(mu))_(2)\left(p_{\mu}\right)_{1}+\left(p_{\mu}\right)_{2}(pμ)1+(pμ)2 is conserved in a point collision (at any r r rrr ). Therefore the totals ( E ) 1 + ( E ) 2 = ( p 0 ) 1 + ( p 0 ) 2 ( E ) 1 + ( E ) 2 = p 0 1 + p 0 2 (E)_(1)+(E)_(2)=(-p_(0))_(1)+(-p_(0))_(2)(E)_{1}+(E)_{2}=\left(-p_{0}\right)_{1}+\left(-p_{0}\right)_{2}(E)1+(E)2=(p0)1+(p0)2 and ( p ϕ ) 1 + ( p ϕ ) 2 p ϕ 1 + p ϕ 2 (p_(phi))_(1)+(p_(phi))_(2)\left(p_{\phi}\right)_{1}+\left(p_{\phi}\right)_{2}(pϕ)1+(pϕ)2 are also conserved. One of the colliding particles may be on an orbit that could never reach out to r = r = r=oor=\inftyr=, but this makes no difference. This conservation principle allows and forces one to take over the terms E = E = E=E=E= "energy at infinity" and L = L = L=L=L= "angular momentum," valid for orbits that do reach to infinity, for an orbit that does not reach to infinity.

EXERCISES

Exercise 25.7. RADIAL VELOCITY OF A TEST PARTICLE

Obtain a formula for the radial component of velocity v r ^ v r ^ v_( hat(r))v_{\hat{r}}vr^ that an observer at r r rrr would measure [see (25.20) for v ϕ ^ v ϕ ^ v_( hat(phi))v_{\hat{\phi}}vϕ^ ]. Express E local , v r ^ E local  , v r ^ E_("local "),v_( hat(r))E_{\text {local }}, v_{\hat{r}}Elocal ,vr^, and v ϕ ^ v ϕ ^ v_( hat(phi))v_{\hat{\phi}}vϕ^ in terms of r r rrr and the constants E , p ϕ E , p ϕ E,p_(phi)E, p_{\phi}E,pϕ.

Exercise 25.8. ROTATIONAL KILLING VECTORS FOR SCHWARZSCHILD GEOMETRY

(a) Show that in the isotropic coordinates of exercise 23.1, the metric for the Schwarzschild geometry takes the form
(25.23) d s 2 = ( 1 M / 2 r ¯ ) 2 ( 1 + M / 2 r ¯ ) 2 d t 2 + ( 1 + M / 2 r ¯ ) 4 ( d r ¯ 2 + r ¯ 2 d Ω 2 ) (25.23) d s 2 = ( 1 M / 2 r ¯ ) 2 ( 1 + M / 2 r ¯ ) 2 d t 2 + ( 1 + M / 2 r ¯ ) 4 d r ¯ 2 + r ¯ 2 d Ω 2 {:(25.23)ds^(2)=-(1-M//2 bar(r))^(2)(1+M//2 bar(r))^(-2)dt^(2)+(1+M//2 bar(r))^(4)(d bar(r)^(2)+ bar(r)^(2)dOmega^(2)):}\begin{equation*} d s^{2}=-(1-M / 2 \bar{r})^{2}(1+M / 2 \bar{r})^{-2} d t^{2}+(1+M / 2 \bar{r})^{4}\left(d \bar{r}^{2}+\bar{r}^{2} d \Omega^{2}\right) \tag{25.23} \end{equation*}(25.23)ds2=(1M/2r¯)2(1+M/2r¯)2dt2+(1+M/2r¯)4(dr¯2+r¯2dΩ2)
(b) Exhibit a coordinate transformation that brings this into the form
(25.24) d s 2 = ( 1 M / 2 r ¯ ) 2 ( 1 + M / 2 r ¯ ) 2 d t 2 + ( 1 + M / 2 r ¯ ) 4 ( d x 2 + d y 2 + d z 2 ) (25.24) d s 2 = ( 1 M / 2 r ¯ ) 2 ( 1 + M / 2 r ¯ ) 2 d t 2 + ( 1 + M / 2 r ¯ ) 4 d x 2 + d y 2 + d z 2 {:(25.24)ds^(2)=-(1-M//2 bar(r))^(2)(1+M//2 bar(r))^(-2)dt^(2)+(1+M//2 bar(r))^(4)(dx^(2)+dy^(2)+dz^(2)):}\begin{equation*} d s^{2}=-(1-M / 2 \bar{r})^{2}(1+M / 2 \bar{r})^{-2} d t^{2}+(1+M / 2 \bar{r})^{4}\left(d x^{2}+d y^{2}+d z^{2}\right) \tag{25.24} \end{equation*}(25.24)ds2=(1M/2r¯)2(1+M/2r¯)2dt2+(1+M/2r¯)4(dx2+dy2+dz2)
with r ¯ = ( x 2 + y 2 + z 2 ) 1 / 2 r ¯ = x 2 + y 2 + z 2 1 / 2 bar(r)=(x^(2)+y^(2)+z^(2))^(1//2)\bar{r}=\left(x^{2}+y^{2}+z^{2}\right)^{1 / 2}r¯=(x2+y2+z2)1/2.
(c) Show that ξ x = y ( / z ) z ( / y ) ξ x = y ( / z ) z ( / y ) xi_(x)=y(del//del z)-z(del//del y)\xi_{x}=y(\partial / \partial z)-z(\partial / \partial y)ξx=y(/z)z(/y) and similar vectors ξ y ξ y xi_(y)\xi_{y}ξy and ξ z ξ z xi_(z)\xi_{z}ξz are each Killing vectors, by verifying (see exercise 25.5 c ) that the Poisson brackets [ K , L K ] K , L K [K,L_(K)]\left[\mathcal{K}, L_{K}\right][K,LK] vanish for each L K = p ξ K , K = x , y , z L K = p ξ K , K = x , y , z L_(K)=p*xi_(K),K=x,y,zL_{K}=\boldsymbol{p} \cdot \xi_{K}, K=x, y, zLK=pξK,K=x,y,z.
(d) Show that ξ z = ( / ϕ ) t , r , θ ξ z = ( / ϕ ) t , r , θ xi_(z)=(del//del phi)_(t,r,theta)\xi_{z}=(\partial / \partial \phi)_{t, r, \theta}ξz=(/ϕ)t,r,θ; and show that for orbits in the equatorial plane L z = p ϕ L z = p ϕ L_(z)=p_(phi)L_{z}=p_{\phi}Lz=pϕ, L x = L y = 0 L x = L y = 0 L_(x)=L_(y)=0L_{x}=L_{y}=0Lx=Ly=0.

Exercise 25.9. CONSERVATION OF TOTAL ANGULAR MOMENTUM OF A TEST PARTICLE

Prove by a Poisson-bracket calculation that the total angular momentum squared, L 2 = L 2 = L^(2)=L^{2}=L2= p θ 2 + ( sin θ ) 2 p ϕ 2 p θ 2 + ( sin θ ) 2 p ϕ 2 p_(theta)^(2)+(sin theta)^(-2)p_(phi)^(2)p_{\theta}{ }^{2}+(\sin \theta)^{-2} p_{\phi}{ }^{2}pθ2+(sinθ)2pϕ2 is a constant of motion for any Schwarzschild geodesic.

Exercise 25.10. SELECTING EQUATION BY SELECTING WHAT IS VARIED

Write out the integral I I III that is varied in (25.8) for the special case of the Schwarzschild metric (25.12). What equation results from the demand δ I = 0 δ I = 0 delta I=0\delta I=0δI=0 if only ϕ ( λ ) ϕ ( λ ) phi(lambda)\phi(\lambda)ϕ(λ) is varied? If only t ( λ ) t ( λ ) t(lambda)t(\lambda)t(λ) ?

Exercise 25.11. MOTION DERIVED FROM SUPER-HAMILTONIAN

Write out the super-Hamiltonian (25.10) for the special case of the Schwarzschild metric. Deduce from its form that p 0 p 0 p_(0)p_{0}p0 and p ϕ p ϕ p_(phi)p_{\phi}pϕ are constants of motion. Derive (25.14), (25.17), and (25.18) from this super-Hamiltonian formalism.

§25.4. GRAVITATIONAL REDSHIFT

The conservation law | g 00 | 1 / 2 E local = g 00 1 / 2 E local  = |g_(00)|^(1//2)E_("local ")=\left|g_{00}\right|^{1 / 2} E_{\text {local }}=|g00|1/2Elocal = constant (equation 25.21), which is valid in this form for any time-independent metric with g 0 j = 0 g 0 j = 0 g_(0j)=0g_{0 j}=0g0j=0 and for particles with both zero and non-zero rest mass, is sometimes called the "law of energy red-shift." It describes how the locally measured energy of any particle or photon changes (is "red-shifted" or "blue-shifted") as it climbs out of or falls into a static gravitational field. For a particle of zero rest mass (photon or neutrino), the locally measured energy E local E local  E_("local ")E_{\text {local }}Elocal , and wavelength λ local λ local  lambda_("local ")\lambda_{\text {local }}λlocal  (not to be confused with affine parameter!), are related by E local = h / λ local E local  = h / λ local  E_("local ")=h//lambda_("local ")E_{\text {local }}=h / \lambda_{\text {local }}Elocal =h/λlocal , where h h hhh is Planck's constant. Consequently, the law of energy red-shift can be rewritten as
(25.25) λ local | g o o | 1 / 2 = constant. (25.25) λ local  g o o 1 / 2 =  constant.  {:(25.25)lambda_("local ")|g_(oo)|^(-1//2)=" constant. ":}\begin{equation*} \lambda_{\text {local }}\left|g_{o o}\right|^{-1 / 2}=\text { constant. } \tag{25.25} \end{equation*}(25.25)λlocal |goo|1/2= constant. 
A photon emitted by an atom at rest in the gravitational field at radius r r rrr, and received by an astronomer at rest at infinity is red-shifted by the amount
z Δ λ / λ = ( λ received λ emitted ) / λ emitted = | g 00 ( r ) | 1 / 2 1 (25.26) z = ( 1 2 M / r ) 1 / 2 1 (25.26~N) z M / r in Newtonian limit. z Δ λ / λ = λ received  λ emitted  / λ emitted  = g 00 ( r ) 1 / 2 1 (25.26) z = ( 1 2 M / r ) 1 / 2 1 (25.26~N) z M / r  in Newtonian limit.  {:[z-=Delta lambda//lambda=(lambda_("received ")-lambda_("emitted "))//lambda_("emitted ")=|g_(00)(r)|^(-1//2)-1],[(25.26)z=(1-2M//r)^(-1//2)-1],[(25.26~N)z~~M//r" in Newtonian limit. "]:}\begin{align*} & z \equiv \Delta \lambda / \lambda=\left(\lambda_{\text {received }}-\lambda_{\text {emitted }}\right) / \lambda_{\text {emitted }}=\left|g_{00}(r)\right|^{-1 / 2}-1 \\ & z=(1-2 M / r)^{-1 / 2}-1 \tag{25.26}\\ & z \approx M / r \text { in Newtonian limit. } \tag{25.26~N} \end{align*}zΔλ/λ=(λreceived λemitted )/λemitted =|g00(r)|1/21(25.26)z=(12M/r)1/21(25.26~N)zM/r in Newtonian limit. 
Note that these expressions are valid whether the photon travels along a radial path or not.

Exercise 25.12. REDSHIFT BY TIMED PULSES

EXERCISE

Derive expression (25.26) for the photon redshift by considering two pulses of light emitted successively by an atom at rest at radius r r rrr. [Hint: If Δ τ em Δ τ em  Deltatau_("em ")\Delta \tau_{\text {em }}Δτem  is the proper time between pulses as measured by the emitting atom, and Δ τ rec Δ τ rec  Deltatau_("rec ")\Delta \tau_{\text {rec }}Δτrec  is the proper time separation as measured by the observer at r = r = r=oor=\inftyr=, then one can idealize λ em λ em  lambda_("em ")\lambda_{\text {em }}λem  as Δ τ em Δ τ em  Deltatau_("em ")\Delta \tau_{\text {em }}Δτem  and λ rec λ rec  lambda_("rec ")\lambda_{\text {rec }}λrec  as Δ τ rec. Δ τ rec.  Deltatau_("rec. ")\Delta \tau_{\text {rec. }}Δτrec. .]
Law of "energy redshift" ('gravitational redshift")
Qualitative features of orbits diagnosed from effective-potential diagram

and illustrated in Figure 25.2 and Box 25.6. The first diagram in Box 25.6 gives V ~ 2 ( r ) V ~ 2 ( r ) widetilde(V)^(2)(r)\widetilde{V}^{2}(r)V~2(r) as a function of r r rrr. It is relevant even in the "domain inside the black hole" ( r < 2 M ) ( r < 2 M ) (r < 2M)(r<2 M)(r<2M), where V ~ 2 V ~ 2 widetilde(V)^(2)\widetilde{V}^{2}V~2 is negative (see Chapter 31). It serves as a model for, and is closely related to, the "effective potential" B 2 ( r ) B 2 ( r ) B^(-2)(r)B^{-2}(r)B2(r) used in § 25.6 § 25.6 §25.6\S 25.6§25.6 to analyze photon orbits. The final diagram in Box 25.6 gives V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r) itself as a function of r r rrr. Energy levels in this diagram or in Figure 25.2 can be interpreted as in any conventional energy-level diagram. The difference in energy between two levels represents energy, as measured at infinity, of the photon given off in the transition from the one level to the other. Whether one plots V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r) or V ~ 2 ( r ) V ~ 2 ( r ) widetilde(V)^(2)(r)\widetilde{V}^{2}(r)V~2(r) as a function of r r rrr is largely a matter of convenience. The important point is this: a value of r r rrr where V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r) becomes equal to the available energy E ~ E ~ widetilde(E)\widetilde{E}E~, or V ~ 2 ( r ) V ~ 2 ( r ) widetilde(V)^(2)(r)\widetilde{V}^{2}(r)V~2(r) becomes equal to E ~ 2 E ~ 2 widetilde(E)^(2)\widetilde{E}^{2}E~2, is a turning point. A particle that was moving to larger r r rrr values, once arrived at a turning point, turns around and moves to smaller r r rrr values. Or when a particle moving to smaller r r rrr values comes to a turning point, it reverses its motion and proceeds to larger r r rrr values. A turning point is not a point of equilibrium. A stone thrown straight up does not sit at a point of equilibrium at the top of its flight. However, when E ~ V ~ ( r ) E ~ V ~ ( r ) widetilde(E)- widetilde(V)(r)\widetilde{E}-\widetilde{V}(r)E~V~(r), or E ~ 2 V ~ 2 ( r ) E ~ 2 V ~ 2 ( r ) widetilde(E)^(2)- widetilde(V)^(2)(r)\widetilde{E}^{2}-\widetilde{V}^{2}(r)E~2V~2(r), instead of having a single root, has a double root, then one does deal with a point of equilibrium (only possible because of "centrifugal force" fighting against gravity). When this equilibrium occurs at a minimum of V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r), it is a stable equilibrium; at a maximum, an unstable equilibrium. Thus all the major features of the motion in the r r rrr direction can be read from a plot of the effective potential as a function of r r rrr (plot depends on value of L ~ L ~ widetilde(L)\widetilde{L}L~ ) and from a knowledge of the E ~ E ~ widetilde(E)\widetilde{E}E~ value (Figure 25.2, with further details in Box 25.6).
Box 25.6 QUALITATIVE FEATURES OF ORBITS OF A PARTICLE MOVING IN SCHWARZSCHILD GEOMETRY

A. Equations Governing Orbit (see text for derivation)

  1. Effective-potential equation for radial part of motion:
( d r / d τ ) 2 + V ~ 2 ( L ~ , r ) = E ~ 2 V ~ 2 ( L ~ , r ) = ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ( d r / d τ ) 2 + V ~ 2 ( L ~ , r ) = E ~ 2 V ~ 2 ( L ~ , r ) = ( 1 2 M / r ) 1 + L ~ 2 / r 2 {:[(dr//d tau)^(2)+ widetilde(V)^(2)( widetilde(L)","r)= widetilde(E)^(2)],[ widetilde(V)^(2)( widetilde(L)","r)=(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]:}\begin{gathered} (d r / d \tau)^{2}+\widetilde{V}^{2}(\widetilde{L}, r)=\widetilde{E}^{2} \\ \widetilde{V}^{2}(\widetilde{L}, r)=(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right) \end{gathered}(dr/dτ)2+V~2(L~,r)=E~2V~2(L~,r)=(12M/r)(1+L~2/r2)
E ~ = E ~ = widetilde(E)=\widetilde{E}=E~= (energy at infinity per unit rest mass),
L ~ = L ~ = widetilde(L)=\widetilde{L}=L~= (angular momentum per unit rest mass).
2. Supplementary equations for angular and time motion for "direct" orbit, d ϕ ˙ / d τ > 0 d ϕ ˙ / d τ > 0 dphi^(˙)//d tau > 0d \dot{\phi} / d \tau>0dϕ˙/dτ>0 :
d ϕ / d τ = L ~ / r 2 d t d τ = E ~ 1 2 M / r d ϕ / d τ = L ~ / r 2 d t d τ = E ~ 1 2 M / r {:[d phi//d tau= widetilde(L)//r^(2)],[(dt)/(d tau)=(( widetilde(E)))/(1-2M//r)]:}\begin{gathered} d \phi / d \tau=\widetilde{L} / r^{2} \\ \frac{d t}{d \tau}=\frac{\widetilde{E}}{1-2 M / r} \end{gathered}dϕ/dτ=L~/r2dtdτ=E~12M/r

"Turning points" of orbit occur where horizontal line of height E ~ 2 E ~ 2 widetilde(E)^(2)\widetilde{E}^{2}E~2 crosses V ~ 2 V ~ 2 widetilde(V)^(2)\widetilde{V}^{2}V~2
B. Newtonian Limit, | E ~ 1 | 1 | E ~ 1 | 1 | widetilde(E)-1|≪1|\widetilde{E}-1| \ll 1|E~1|1, M / r 1 , L ~ / r 1 M / r 1 , L ~ / r 1 M//r≪1, widetilde(L)//r≪1M / r \ll 1, \widetilde{L} / r \ll 1M/r1,L~/r1
  1. Speak not about "energy-at-infinity per unit rest mass," E ~ = E / μ = ( 1 v 2 ) 1 / 2 E ~ = E / μ = 1 v 2 1 / 2 widetilde(E)=E//mu=(1-v_(oo)^(2))^(-1//2)\widetilde{E}=E / \mu=\left(1-v_{\infty}^{2}\right)^{-1 / 2}E~=E/μ=(1v2)1/2, but instead about the "nonrelativistic energy per unit rest mass,"
ε 1 2 ( E ~ 2 1 ) E ~ 1 1 2 v 2 . ε 1 2 E ~ 2 1 E ~ 1 1 2 v 2 . epsi-=(1)/(2)( widetilde(E)^(2)-1)~~ widetilde(E)-1~~(1)/(2)v_(oo)^(2).\varepsilon \equiv \frac{1}{2}\left(\widetilde{E}^{2}-1\right) \approx \widetilde{E}-1 \approx \frac{1}{2} v_{\infty}^{2} .ε12(E~21)E~112v2.
  1. Speak not about V ~ 2 ( L ~ , r ) V ~ 2 ( L ~ , r ) widetilde(V)^(2)( widetilde(L),r)\widetilde{V}^{2}(\widetilde{L}, r)V~2(L~,r) but instead about the Newtonian effective potential,
V N ( L ~ , r ) 1 2 ( V ~ 2 1 ) M r + L ~ 2 2 r 2 . V N ( L ~ , r ) 1 2 V ~ 2 1 M r + L ~ 2 2 r 2 . V_(N)( widetilde(L),r)-=(1)/(2)( widetilde(V)^(2)-1)~~-(M)/(r)+( widetilde(L)^(2))/(2r^(2)).V_{N}(\widetilde{L}, r) \equiv \frac{1}{2}\left(\widetilde{V}^{2}-1\right) \approx-\frac{M}{r}+\frac{\widetilde{L}^{2}}{2 r^{2}} .VN(L~,r)12(V~21)Mr+L~22r2.
  1. Rewrite effective-potential equation in the form
1 2 ( d r d τ ) 2 + V N ( L ~ , r ) = ε 1 2 d r d τ 2 + V N ( L ~ , r ) = ε (1)/(2)((dr)/(d tau))^(2)+V_(N)( widetilde(L),r)=epsi\frac{1}{2}\left(\frac{d r}{d \tau}\right)^{2}+V_{N}(\widetilde{L}, r)=\varepsilon12(drdτ)2+VN(L~,r)=ε
  1. From the effective-potential diagram and the subsidiary equation d ϕ / d τ = L ~ / r 2 d ϕ / d τ = L ~ / r 2 d phi//d tau= widetilde(L)//r^(2)d \phi / d \tau=\widetilde{L} / r^{2}dϕ/dτ=L~/r2, conclude that:
    a. Particles with ε 0 ( E ~ 1 ) ε 0 ( E ~ 1 ) epsi >= 0( widetilde(E) >= 1)\varepsilon \geq 0(\widetilde{E} \geq 1)ε0(E~1) come in from r = r = r=oor=\inftyr= along hyperbolic or parabolic orbits, are reflected off the effective potential at ε = V N [ E ~ 2 = V ~ 2 ε = V N E ~ 2 = V ~ 2 epsi=V_(N)[ widetilde(E)^(2)= widetilde(V)^(2):}\varepsilon=V_{N}\left[\widetilde{E}^{2}=\widetilde{V}^{2}\right.ε=VN[E~2=V~2; "turning point"; ( d r / d τ ) 2 = 0 ] ( d r / d τ ) 2 = 0 {:(dr//d tau)^(2)=0]\left.(d r / d \tau)^{2}=0\right](dr/dτ)2=0], and return to r = r = r=oor=\inftyr=.
    b. Particles with ε < 0 ( E ~ < 1 ) ε < 0 ( E ~ < 1 ) epsi < 0( widetilde(E) < 1)\varepsilon<0(\widetilde{E}<1)ε<0(E~<1) move back and forth in an effective potential well between periastron (inner turning point of elliptic orbit) and apastron (outer turning point).

C. Relativistic Orbits

Use the effective-potential diagram of part A (reproduced here for various L L L L L^(L)\stackrel{L}{L}LL ), in the same way one uses the Newtonian diagram of part B B BBB, to deduce the qualitative features of the orbits. The main conclusions are these.

Box 25.6 (continued)

  1. Orbits with periastrons at r M r M r≫Mr \gg MrM are Keplerian in form, except for the periastron shift (exercise 25.16; §40.5) familiar for Mercury.
  2. Orbits with periastrons at r 10 M r 10 M r <= 10Mr \leqslant 10 \mathrm{M}r10M differ markedly from Keplerian orbits.
  3. For L ¯ / M 2 3 L ¯ / M 2 3 bar(L)//M <= 2sqrt3\bar{L} / M \leq 2 \sqrt{3}L¯/M23 there is no periastron; any incoming particle is necessarily pulled into r = 2 M r = 2 M r=2Mr=2 Mr=2M.
  4. For 2 3 < L ~ / M < 4 2 3 < L ~ / M < 4 2sqrt3 < tilde(L)//M < 42 \sqrt{3}<\tilde{L} / M<423<L~/M<4 there are bound orbits in which the particle moves in and out between periastron and apastron; but any particle coming in from r = r = r=oor=\inftyr= (unbound; E ~ 2 1 E ~ 2 1 widetilde(E)^(2) >= 1\widetilde{E}^{2} \geq 1E~21 ) necessarily gets pulled into r = 2 M r = 2 M r=2Mr=2 Mr=2M.
  5. For L = L ~ / M > 4 L = L ~ / M > 4 L^(†)= widetilde(L)//M > 4L^{\dagger}=\widetilde{L} / M>4L=L~/M>4, there are bound orbits; particles coming in from r = r = r=oor=\inftyr= with
E ~ 2 < V ~ max 2 = ( 1 2 u m ) ( 1 + L 2 u m 2 ) u m 1 + 1 12 / L 2 6 E ~ 2 < V ~ max 2 = 1 2 u m 1 + L 2 u m 2 u m 1 + 1 12 / L 2 6 {:[ tilde(E)^(2) < tilde(V)_(max)^(2)=(1-2u_(m))(1+L^(†2)u_(m)^(2))],[u_(m)-=(1+sqrt(1-12//L^(†2)))/(6)]:}\begin{gathered} \tilde{E}^{2}<\tilde{V}_{\max }^{2}=\left(1-2 u_{m}\right)\left(1+L^{\dagger 2} u_{m}^{2}\right) \\ u_{m} \equiv \frac{1+\sqrt{1-12 / L^{\dagger 2}}}{6} \end{gathered}E~2<V~max2=(12um)(1+L2um2)um1+112/L26
reach periastrons and then return to r = r = r=r=r= oo\infty; but particles from r = r = r=oor=\inftyr= with E ¯ 2 > E ¯ 2 > bar(E)^(2) >\bar{E}^{2}>E¯2> V ~ max 2 V ~ max 2 widetilde(V)_(max)^(2)\widetilde{V}_{\max }{ }^{2}V~max2 get pulled into r = 2 M r = 2 M r=2Mr=2 \mathrm{M}r=2M.
6. There are stable circular orbits at the minimum of the effective potential; the minimum moves inward from r = r = r=oor=\inftyr= for L ~ = L ~ = widetilde(L)=\widetilde{L}=L~= oo\infty to r = 6 M r = 6 M r=6Mr=6 Mr=6M for L = L ~ / M = 2 3 L = L ~ / M = 2 3 L^(†)= widetilde(L)//M=2sqrt3L^{\dagger}=\widetilde{L} / M=2 \sqrt{3}L=L~/M=23. The most tightly bound, stable circular orbit ( L ~ / M = 2 3 , r = 6 M ) ( L ~ / M = 2 3 , r = 6 M ) ( widetilde(L)//M=2sqrt3,r=6M)(\widetilde{L} / M=2 \sqrt{3}, r=6 M)(L~/M=23,r=6M) has a fractional binding energy of
μ E μ = 1 E ¯ = 1 8 / 9 = 0.0572 μ E μ = 1 E ¯ = 1 8 / 9 = 0.0572 (mu-E)/(mu)=1- bar(E)=1-sqrt(8//9)=0.0572\frac{\mu-E}{\mu}=1-\bar{E}=1-\sqrt{8 / 9}=0.0572μEμ=1E¯=18/9=0.0572
  1. There are unstable circular orbits at the maximum of the effective potential; the maximum moves outward from r = 3 M r = 3 M r=3Mr=3 Mr=3M for L ~ = L ~ = widetilde(L)=oo\widetilde{L}=\inftyL~= to r = 6 M r = 6 M r=6Mr=6 Mr=6M for L ~ / M = 2 3 L ~ / M = 2 3 widetilde(L)//M=2sqrt3\widetilde{L} / M=2 \sqrt{3}L~/M=23. A particle in such a circular orbit, if perturbed inward, will spiral into r = 2 M r = 2 M r=2Mr=2 Mr=2M. If perturbed outward, and if it has E ~ 2 > 1 E ~ 2 > 1 widetilde(E)^(2) > 1\widetilde{E}^{2}>1E~2>1, it will escape to r = r = r=oor=\inftyr=. If perturbed out-

    ward, and if it has E ¯ 2 < 1 E ¯ 2 < 1 bar(E)^(2) < 1\bar{E}^{2}<1E¯2<1, it will either reach an apastron and then enter a spiraling orbit that eventually falls into the star (e.g., if δ E ~ > 0 δ E ~ > 0 delta widetilde(E) > 0\delta \widetilde{E}>0δE~>0, with unchanged angular momentum); or it will move out and in between apastron and periastron, in a stable bound orbit (e.g., if δ E ~ < 0 δ E ~ < 0 delta tilde(E) < 0\delta \tilde{E}<0δE~<0, again with unchanged angular momentum).
When one turns from qualitative features to quantitative results, one finds it appropriate to write down explicitly the proper time Δ τ Δ τ Delta tau\Delta \tauΔτ required for the particle to augment its Schwarzschild coordinate by the amount Δ r Δ r Delta r\Delta rΔr; thus (with the convention that square roots may be negative or positive, a 2 ± a a 2 ± a sqrt(a^(2))-=+-a\sqrt{a^{2}} \equiv \pm aa2±a )
(25.27) τ = d τ = d r [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 (25.27) τ = d τ = d r E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 {:(25.27)tau=int d tau=int(dr)/([ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2)):}\begin{equation*} \tau=\int d \tau=\int \frac{d r}{\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2}} \tag{25.27} \end{equation*}(25.27)τ=dτ=dr[E~2(12M/r)(1+L~2/r2)]1/2
The integration is especially simple for a particle falling straight in, or climbing straight out, for then the angular momentum vanishes and the integral can be written in an elementary form that applies (with the change τ t τ t tau longrightarrow t\tau \longrightarrow tτt ) even in Newtonian mechanics,
(25.27') τ = d τ = d r [ 2 M / r 2 M / R ] 1 / 2 (25.27') τ = d τ = d r [ 2 M / r 2 M / R ] 1 / 2 {:(25.27')tau=int d tau=int(dr)/([2M//r-2M//R]^(1//2)):}\begin{equation*} \tau=\int d \tau=\int \frac{d r}{[2 M / r-2 M / R]^{1 / 2}} \tag{25.27'} \end{equation*}(25.27')τ=dτ=dr[2M/r2M/R]1/2
Here R 2 M / ( 1 E 2 ) R 2 M / 1 E 2 R-=2M//(1-E^(2))R \equiv 2 M /\left(1-E^{2}\right)R2M/(1E2) is the radius at which the particle has zero velocity ("apastron"). The motion follows the same "cycloid principle" that is so useful in nonrelativistic mechanics (Figure 25.3). Thus, in parametric form, one has
r = R 2 ( 1 + cos η ) , (25.28) τ = R 2 ( R 2 M ) 1 / 2 ( η + sin η ) , r = R 2 ( 1 + cos η ) , (25.28) τ = R 2 R 2 M 1 / 2 ( η + sin η ) , {:[r=(R)/(2)(1+cos eta)","],[(25.28)tau=(R)/(2)((R)/(2M))^(1//2)(eta+sin eta)","]:}\begin{gather*} r=\frac{R}{2}(1+\cos \eta), \\ \tau=\frac{R}{2}\left(\frac{R}{2 M}\right)^{1 / 2}(\eta+\sin \eta), \tag{25.28} \end{gather*}r=R2(1+cosη),(25.28)τ=R2(R2M)1/2(η+sinη),
with the total proper time to fall from rest at r = R r = R r=Rr=Rr=R into r = 0 r = 0 r=0r=0r=0 given by the expression
(25.29) τ = π 2 R ( R 2 M ) 1 / 2 (25.29) τ = π 2 R R 2 M 1 / 2 {:(25.29)tau=(pi)/(2)R((R)/(2M))^(1//2):}\begin{equation*} \tau=\frac{\pi}{2} R\left(\frac{R}{2 M}\right)^{1 / 2} \tag{25.29} \end{equation*}(25.29)τ=π2R(R2M)1/2
(shorter by a factor 1 / 2 1 / 2 1//sqrt21 / \sqrt{2}1/2 than the time for fall under pull of the same mass, distributed over a sphere of radius R R RRR; see dotted curve in Figure 25.3).
What about the Schwarzschild-coordinate time taken for a given motion? Take equation (25.16a) for general motion (radial or nonradial), and where d r / d τ d r / d τ dr//d taud r / d \taudr/dτ appears, replace it by
(25.30) d r d τ = d r d t d t d τ = d r d t E ~ 1 2 M / r = E ~ d r d t . (25.30) d r d τ = d r d t d t d τ = d r d t E ~ 1 2 M / r = E ~ d r d t . {:(25.30)(dr)/(d tau)=(dr)/(dt)(dt)/(d tau)=(dr)/(dt)(( widetilde(E)))/(1-2M//r)= widetilde(E)(dr^(**))/(dt).:}\begin{equation*} \frac{d r}{d \tau}=\frac{d r}{d t} \frac{d t}{d \tau}=\frac{d r}{d t} \frac{\widetilde{E}}{1-2 M / r}=\widetilde{E} \frac{d r^{*}}{d t} . \tag{25.30} \end{equation*}(25.30)drdτ=drdtdtdτ=drdtE~12M/r=E~drdt.
Here r r r^(**)r^{*}r is an abbreviation for a new "tortoise coordinate,"
(25.31) r = d r = d r 1 2 M / r = r + 2 M ln ( r 2 M 1 ) (25.31) r = d r = d r 1 2 M / r = r + 2 M ln r 2 M 1 {:(25.31)r^(**)=int dr^(**)=int(dr)/(1-2M//r)=r+2M ln((r)/(2M)-1):}\begin{equation*} r^{*}=\int d r^{*}=\int \frac{d r}{1-2 M / r}=r+2 M \ln \left(\frac{r}{2 M}-1\right) \tag{25.31} \end{equation*}(25.31)r=dr=dr12M/r=r+2Mln(r2M1)
which was introduced by Wheeler (1955) and popularized by Regge and Wheeler (1957). Thus find the equation
(25.32) ( E ~ d r d t ) 2 + V ~ 2 = E ~ 2 (25.32) E ~ d r d t 2 + V ~ 2 = E ~ 2 {:(25.32)(( widetilde(E))(dr^(**))/(dt))^(2)+ widetilde(V)^(2)= widetilde(E)^(2):}\begin{equation*} \left(\widetilde{E} \frac{d r^{*}}{d t}\right)^{2}+\widetilde{V}^{2}=\widetilde{E}^{2} \tag{25.32} \end{equation*}(25.32)(E~drdt)2+V~2=E~2
Figure 25.3.
A cycloid gives the relation between proper time and Schwarschild r r rrr coordinate for a test particle falling straight in toward center of gravitational attraction of negligible dimensions. The angle of turn of the wheel as it rolls on the base line and generates the cycloid is denoted by η η eta\etaη. In terms of this parameter, one has
r = R 2 ( 1 + cos η ) (Schwarzschild r -coordinate) τ = R 2 ( R 2 M ) 1 / 2 ( η + sin η ) (proper time) r = R 2 ( 1 + cos η )  (Schwarzschild  r -coordinate)  τ = R 2 R 2 M 1 / 2 ( η + sin η )  (proper time)  {:[r=(R)/(2)(1+cos eta)quad" (Schwarzschild "r"-coordinate) "],[tau=(R)/(2)((R)/(2M))^(1//2)(eta+sin eta)quad" (proper time) "]:}\begin{gathered} r=\frac{R}{2}(1+\cos \eta) \quad \text { (Schwarzschild } r \text {-coordinate) } \\ \tau=\frac{R}{2}\left(\frac{R}{2 M}\right)^{1 / 2}(\eta+\sin \eta) \quad \text { (proper time) } \end{gathered}r=R2(1+cosη) (Schwarzschild r-coordinate) τ=R2(R2M)1/2(η+sinη) (proper time) 
(note difference in scale factors in expressions for r r rrr and for τ τ tau\tauτ ). The total lapse of proper time to fall from r = R r = R r=Rr=Rr=R to r = 0 r = 0 r=0r=0r=0 is τ = ( π / 2 ) ( R 3 / 2 M ) 1 / 2 τ = ( π / 2 ) R 3 / 2 M 1 / 2 tau=(pi//2)(R^(3)//2M)^(1//2)\tau=(\pi / 2)\left(R^{3} / 2 M\right)^{1 / 2}τ=(π/2)(R3/2M)1/2. The same cycloid relation and the same expression for time to fall holds in Newton's nonrelativistic theory of gravitation, except that there the symbol τ τ tau\tauτ is to be replaced by the symbol t t ttt (ordinary time). Were one dealing in Newtonian theory with the same attracting mass M M MMM spread uniformly over a sphere of radius R R RRR, with a pipe thrust through it to make a channel for the motion of the test particle, then that particle would execute simple harmonic oscillations (dotted curve above). The angular frequency ω ω omega\omegaω of these vibrations would be identical with the angular frequency of revolution of the test particle in a circle just grazing the surface of the planet, a frequency given by Kepler's law M = ω 2 R 3 M = ω 2 R 3 M=omega^(2)R^(3)M=\omega^{2} R^{3}M=ω2R3. In this case, the time to fall to the center would be ( π / 2 ) ( R 3 / M ) 1 / 2 ( π / 2 ) R 3 / M 1 / 2 (pi//2)(R^(3)//M)^(1//2)(\pi / 2)\left(R^{3} / M\right)^{1 / 2}(π/2)(R3/M)1/2, longer by a factor 2 1 / 2 2 1 / 2 2^(1//2)2^{1 / 2}21/2 than for a concentrated center of attraction (concentrated mass: stronger acceleration and higher velocity in the later phases of the fall). The expression for the Schwarzschild-coordinate time t t ttt required to reach any point r r rrr in the fall under the influence of a concentrated center of attraction is complicated and is not shown here (see equation 25.37 and Figure 25.5).
The same cycloidal relation that connects r r rrr with time for free fall of a particle also connects the radius of the "Friedmann dust-filled universe" with time (see Box 27.1), except that there the cycloid diagram applies directly, without any difference in scale between the two key variables:
( radius of 3 -sphere ) = a 2 ( 1 cos η ) a 4 η 2 ( for small η ) (  radius of  3 -sphere  ) = a 2 ( 1 cos η ) a 4 η 2 (  for small  η ) ((" radius of ")/(3"-sphere "))=(a)/(2)(1-cos eta)≃(a)/(4)eta^(2)(" for small "eta)\binom{\text { radius of }}{3 \text {-sphere }}=\frac{a}{2}(1-\cos \eta) \simeq \frac{a}{4} \eta^{2}(\text { for small } \eta)( radius of 3-sphere )=a2(1cosη)a4η2( for small η)
( coordinate time identical with proper time as measured on dust particle ) = a 2 ( η sin η ) a 12 η 3 (for small η )  coordinate time   identical with   proper time as   measured on dust   particle  = a 2 ( η sin η ) a 12 η 3  (for small  η {:([" coordinate time "],[" identical with "],[" proper time as "],[" measured on dust "],[" particle "])=(a)/(2)(eta-sin eta)≃(a)/( 12)eta^(3)" (for small "eta)\left.\left(\begin{array}{l} \text { coordinate time } \\ \text { identical with } \\ \text { proper time as } \\ \text { measured on dust } \\ \text { particle } \end{array}\right)=\frac{a}{2}(\eta-\sin \eta) \simeq \frac{a}{12} \eta^{3} \text { (for small } \eta\right)( coordinate time  identical with  proper time as  measured on dust  particle )=a2(ηsinη)a12η3 (for small η)
The starting point of η η eta\etaη is renormalized to time of start of expansion; see Lindquist and Wheeler (1957) for more on correlation between fall of particle and expansion of universe.
Here the effective potential is the same effective potential that one dealt with before,
(25.33) V ~ = [ ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 (25.33) V ~ = ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 {:(25.33) widetilde(V)=[(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2):}\begin{equation*} \widetilde{V}=\left[(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2} \tag{25.33} \end{equation*}(25.33)V~=[(12M/r)(1+L~2/r2)]1/2
Moreover, the E ~ E ~ widetilde(E)\widetilde{E}E~ on the righthand side is the same E ~ E ~ widetilde(E)\widetilde{E}E~ that appeared in the earlier equation for ( d r / d τ ) 2 ( d r / d τ ) 2 (dr//d tau)^(2)(d r / d \tau)^{2}(dr/dτ)2. Therefore the turning points and the qualitative description of the motion are both the same as before. "A turning point is a turning point is
a turning point." Right? Right about turning points; wrong about the conclusion.
The story has it that Achilles never could pass the tortoise. Whenever he caught up with where it had been, it had moved ahead to a new location; and when he got there, it was still further ahead; and so on ad infinitum. Imagine the race between Achilles and the tortoise as running to the left and the expected point of passing as lying at r = 2 M r = 2 M r=2Mr=2 Mr=2M. The r r rrr-coordinate has no inhibition about passing through the value r = 2 M r = 2 M r=2Mr=2 Mr=2M. Not so r r r^(**)r^{*}r, the "tortoise coordinate." It can go arbitrarily far in the direction of minus infinity (corresponding to the infinitely many times when Achilles catches up with where the tortoise was) and still r r rrr remains outside r = 2 M r = 2 M r=2Mr=2 Mr=2M :
r / 2 M r / 2 M r//2Mr / 2 Mr/2M 1.000001 1.0001 1.01 1.278465 2 5 10 10,000
r / 2 M r / 2 M r^(**)//2Mr^{*} / 2 Mr/2M -12.8155 -8.2102 -3.5952 0 2 6.386 12.303 10 , 009.210 10 , 009.210 10,009.21010,009.21010,009.210
r//2M 1.000001 1.0001 1.01 1.278465 2 5 10 10,000 r^(**)//2M -12.8155 -8.2102 -3.5952 0 2 6.386 12.303 10,009.210| $r / 2 M$ | 1.000001 | 1.0001 | 1.01 | 1.278465 | 2 | 5 | 10 | 10,000 | | :--- | ---: | ---: | :--- | :--- | :--- | :--- | :--- | :--- | | $r^{*} / 2 M$ | -12.8155 | -8.2102 | -3.5952 | 0 | 2 | 6.386 | 12.303 | $10,009.210$ |
It follows that there is a great difference between the description of the motion in terms of the proper time τ τ tau\tauτ of a clock on the falling particle ( r r rrr goes all the way from r = R r = R r=Rr=Rr=R down to r = 0 r = 0 r=0r=0r=0 in the finite proper time of 25.29) and a description of the motion in terms of the Schwarzschild-coordinate time t t ttt appropriate for the faraway observer ( r r r^(**)r^{*}r goes all the way from r = R r = R r^(**)=R^(**)r^{*}=R^{*}r=R down to r = r = r^(**)=-oor^{*}=-\inftyr=; infinite t t ttt required for this; but even in infinite time, as r r r^(**)r^{*}r goes down to , r , r -oo,r-\infty, r,r is only brought asymptotically down to r 2 M r 2 M r∼2Mr \sim 2 Mr2M ). Thus the second description of the motion leaves out, and has no alternative but to leave out, the whole range of r r rrr values from r = 2 M r = 2 M r=2Mr=2 Mr=2M down to zero: perfectly good physics, and physics that the falling particle is going to see and explore, but physics that the faraway observer never will see and never can see. If the tortoise coordinate did not exist, it would have to be invented. It invests each factor ten of closer approach to r = 2 M r = 2 M r=2Mr=2 Mr=2M with the same interest as the last factor ten and the next to come. It proportions itself in accord with the amount of Schwarzschild-coordinate time available to the faraway observer to study these more and more microscopic amounts of motion in more and more detail.
Figure 25.4 shows the effective potential V ~ V ~ widetilde(V)\widetilde{V}V~ of (25.33) and of Figure 25.2 replotted as a function of the tortoise coordinate. The approach of V ~ V ~ widetilde(V)\widetilde{V}V~ to zero at r = 2 M r = 2 M r=2Mr=2 Mr=2M shows up as an exponential approach of V ~ V ~ widetilde(V)\widetilde{V}V~ to zero as r r r^(**)r^{*}r goes to minus infinity. Thus in moving "towards the black hole" ( r = 2 M , r = ) r = 2 M , r = (r=2M,r^(**)=-oo)\left(r=2 M, r^{*}=-\infty\right)(r=2M,r=), the particle, as described in coordinate time t t ttt, soon casts off any effective influence of any potential, and moves essentially freely toward decreasing r r r^(**)r^{*}r, in accordance with the equation
(25.34) ( E ~ d r d t ) 2 E ~ 2 (25.34) E ~ d r d t 2 E ~ 2 {:(25.34)(( widetilde(E))(dr^(**))/(dt))^(2)≃ widetilde(E)^(2):}\begin{equation*} \left(\widetilde{E} \frac{d r^{*}}{d t}\right)^{2} \simeq \widetilde{E}^{2} \tag{25.34} \end{equation*}(25.34)(E~drdt)2E~2
that is, "with the speed of light" ( d r / d t 1 ) d r / d t 1 (dr^(**)//dt≃-1)\left(d r^{*} / d t \simeq-1\right)(dr/dt1). This dependence of r r r^(**)r^{*}r on t t ttt implies at once an asymptotic dependence of r r rrr itself on Schwarzschild-coordinate time t t ttt, of the form
(25.35) r = 2 M + ( constant × e t / 2 M ) (25.35) r = 2 M +  constant  × e t / 2 M {:(25.35)r=2M+(" constant "xxe^(-t//2M)):}\begin{equation*} r=2 M+\left(\text { constant } \times e^{-t / 2 M}\right) \tag{25.35} \end{equation*}(25.35)r=2M+( constant ×et/2M)
This result is independent of the angular momentum of the particle and independent also of the energy, provided only that the energy-per-unit-mass E ~ E ~ widetilde(E)\widetilde{E}E~ is enough to
(3) details of the approach to the Schwarzschild radius ( r = 2 M r = 2 M r=2Mr=2 Mr=2M )
Figure 25.4.
Effective potential for motion in Schwarzschild geometry, expressed as a function of the tortoise coordinate, for selected values of the angular momentum of the test particle. The angular momentum L L LLL is expressed in units M μ M μ M muM \muMμ, where M M MMM is the mass of the black hole and μ μ mu\muμ the mass of the test particle. The effective potential (including rest mass) is expressed in units μ μ mu\muμ; thus, V ~ = V / μ V ~ = V / μ widetilde(V)=V//mu\widetilde{V}=V / \muV~=V/μ. The tortoise coordinate r = r + 2 M ln ( r / 2 M 1 ) r = r + 2 M ln ( r / 2 M 1 ) r^(**)=r+2M ln(r//2M-1)r^{*}=r+2 M \ln (r / 2 M-1)r=r+2Mln(r/2M1) is given in units M M MMM.
surmount the barrier (Figure 25.4) of the effective potential-per-unit-mass V ~ V ~ widetilde(V)\widetilde{V}V~. (More will be said on the approach to r = 2 M r = 2 M r=2Mr=2 Mr=2M in Chapter 32, on gravitational collapse.)
To replace the asymptotic formula (25.35) by a complete formula requires one to integrate (25.32); thus,
(25.36) t = d t = E ~ d r [ E ~ 2 V ~ 2 ] 1 / 2 = E ~ [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 d r ( 1 2 M / r ) . (25.36) t = d t = E ~ d r E ~ 2 V ~ 2 1 / 2 = E ~ E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 d r ( 1 2 M / r ) . {:[(25.36)t=int dt=int(( widetilde(E))dr^(**))/([ widetilde(E)^(2)- widetilde(V)^(2)]^(1//2))],[=int(( widetilde(E)))/([ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2))(dr)/((1-2M//r)).]:}\begin{align*} t=\int d t & =\int \frac{\widetilde{E} d r^{*}}{\left[\widetilde{E}^{2}-\widetilde{V}^{2}\right]^{1 / 2}} \tag{25.36}\\ & =\int \frac{\widetilde{E}}{\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2}} \frac{d r}{(1-2 M / r)} . \end{align*}(25.36)t=dt=E~dr[E~2V~2]1/2=E~[E~2(12M/r)(1+L~2/r2)]1/2dr(12M/r).
The integration here is not easy, even for pure radial motion ( L ~ = 0 L ~ = 0 widetilde(L)=0\widetilde{L}=0L~=0 ), as is seen in the complication of the resulting expression (Khuri 1957):
(25.37) t = [ ( R 2 + 2 M ) ( R 2 M 1 ) 1 / 2 ] η + R 2 ( R 2 M 1 ) 1 / 2 sin η + 2 M ln | ( R / 2 M 1 ) 1 / 2 + tan ( η / 2 ) ( R / 2 M 1 ) 1 / 2 tan ( η / 2 ) | (25.37) t = [ R 2 + 2 M R 2 M 1 1 / 2 η + R 2 R 2 M 1 1 / 2 sin η + 2 M ln ( R / 2 M 1 ) 1 / 2 + tan ( η / 2 ) ( R / 2 M 1 ) 1 / 2 tan ( η / 2 ) {:[(25.37)t=[{:((R)/(2)+2M)((R)/(2M)-1)^(1//2)]eta+(R)/(2)((R)/(2M)-1)^(1//2)sin eta],[+2M ln|((R//2M-1)^(1//2)+tan(eta//2))/((R//2M-1)^(1//2)-tan(eta//2))|]:}\begin{align*} t=[ & \left.\left(\frac{R}{2}+2 M\right)\left(\frac{R}{2 M}-1\right)^{1 / 2}\right] \eta+\frac{R}{2}\left(\frac{R}{2 M}-1\right)^{1 / 2} \sin \eta \tag{25.37}\\ & +2 M \ln \left|\frac{(R / 2 M-1)^{1 / 2}+\tan (\eta / 2)}{(R / 2 M-1)^{1 / 2}-\tan (\eta / 2)}\right| \end{align*}(25.37)t=[(R2+2M)(R2M1)1/2]η+R2(R2M1)1/2sinη+2Mln|(R/2M1)1/2+tan(η/2)(R/2M1)1/2tan(η/2)|
Here η η eta\etaη is the same cycloid parameter that appears in equation (25.28) and Figure 25.3 (see the detailed plot in Figure 25.5 of the correlation between r r rrr and t t ttt, illustrat-
Figure 25.5.
Fall toward a Schwarzschild black hole as described (a) by a comoving observer (proper time τ τ tau\tauτ ) and (b) by a faraway observer (Schwarzschild-coordinate time t t ttt ). In the one description, the point r = 0 r = 0 r=0r=0r=0 is attained, and quickly [see equation (25.28)]. In the other description, r = 0 r = 0 r=0r=0r=0 is never reached and even r = 2 M r = 2 M r=2Mr=2 Mr=2M is attained only asymptotically [equations (25.35) and (25.37)]. The qualitative features of the motion in both cases are most easily deduced by inspection of the "effective potential-per-unit-mass" V ~ V ~ widetilde(V)\widetilde{V}V~ in its dependence on r r rrr (Figure 25.2) when one is interested in proper time; or the same effective potential V ^ V ^ widehat(V)\widehat{V}V^ in its dependence on the "tortoise coordinate" r r r^(**)r^{*}r [Figure 25.4 and equation (25.31)] when one is interested in Schwarzschild-coordinate time t t ttt.
ing the asymptotic approach to r = 2 M r = 2 M r=2Mr=2 Mr=2M ). The difficulty in the integration for t t ttt, as compared to the ease of the integration for τ τ tau\tauτ (25.28), has a simple origin. Only two r r rrr-values appear in (25.27a) as special points when L ~ L ~ widetilde(L)\widetilde{L}L~ is zero: the starting point, r = R r = R r=Rr=Rr=R, where the velocity vanishes, and the point r = 0 r = 0 r=0r=0r=0, where d r / d τ d r / d τ dr//d taud r / d \taudr/dτ becomes infinite. In contrast (25.36), rewritten as
(25.36') t = d t = [ 1 2 M / R ] 1 / 2 [ 2 M / r 2 M / R ] 1 / 2 d r ( 1 2 M / r ) , (25.36') t = d t = [ 1 2 M / R ] 1 / 2 [ 2 M / r 2 M / R ] 1 / 2 d r ( 1 2 M / r ) , {:(25.36')t=int dt=int([1-2M//R]^(1//2))/([2M//r-2M//R]^(1//2))(dr)/((1-2M//r))",":}\begin{equation*} t=\int d t=\int \frac{[1-2 M / R]^{1 / 2}}{[2 M / r-2 M / R]^{1 / 2}} \frac{d r}{(1-2 M / r)}, \tag{25.36'} \end{equation*}(25.36')t=dt=[12M/R]1/2[2M/r2M/R]1/2dr(12M/r),
contains three special points: r = R , r = 0 r = R , r = 0 r=R,r=0r=R, r=0r=R,r=0, and the added point with all the new physics, r = 2 M r = 2 M r=2Mr=2 Mr=2M. To admit angular momentum is to increase the number of special points still further, and to make the integral unmanageable except numerically or qualitatively (via the potential diagram of Figure 25.4), or in terms of elliptic functions [Hagihara (1931)].
It is often convenient to abstract away from the precise value r = R r = R r=Rr=Rr=R at the start of the collapse. In this event, one deals with the limit R R R longrightarrow ooR \longrightarrow \inftyR. Then it is convenient to displace the zero of proper time to the instant of final catastrophe. In this limit, one has
τ / 2 M = ( 2 / 3 ) ( r / 2 M ) 3 / 2 , (25.38) t / 2 M = ( 2 / 3 ) ( r / 2 M ) 3 / 2 2 ( r / 2 M ) 1 / 2 + ln ( r / 2 M ) 1 / 2 + 1 ( r / 2 M ) 1 / 2 1 τ / 2 M = ( 2 / 3 ) ( r / 2 M ) 3 / 2 , (25.38) t / 2 M = ( 2 / 3 ) ( r / 2 M ) 3 / 2 2 ( r / 2 M ) 1 / 2 + ln ( r / 2 M ) 1 / 2 + 1 ( r / 2 M ) 1 / 2 1 {:[tau//2M=-(2//3)(r//2M)^(3//2)","],[(25.38)t//2M=-(2//3)(r//2M)^(3//2)-2(r//2M)^(1//2)+ln(((r//2M)^(1//2)+1)/((r//2M)^(1//2)-1))]:}\begin{gather*} \tau / 2 M=-(2 / 3)(r / 2 M)^{3 / 2}, \\ t / 2 M=-(2 / 3)(r / 2 M)^{3 / 2}-2(r / 2 M)^{1 / 2}+\ln \frac{(r / 2 M)^{1 / 2}+1}{(r / 2 M)^{1 / 2}-1} \tag{25.38} \end{gather*}τ/2M=(2/3)(r/2M)3/2,(25.38)t/2M=(2/3)(r/2M)3/22(r/2M)1/2+ln(r/2M)1/2+1(r/2M)1/21
At very large negative time, the particle is far away and approaching only very slowly. Then one can write
(25.39a) r = ( 9 M τ 2 / 2 ) 1 / 3 ( 9 M t 2 / 2 ) 1 / 3 (25.39a) r = 9 M τ 2 / 2 1 / 3 9 M t 2 / 2 1 / 3 {:(25.39a)r=(9Mtau^(2)//2)^(1//3)≃(9Mt^(2)//2)^(1//3):}\begin{equation*} r=\left(9 M \tau^{2} / 2\right)^{1 / 3} \simeq\left(9 M t^{2} / 2\right)^{1 / 3} \tag{25.39a} \end{equation*}(25.39a)r=(9Mτ2/2)1/3(9Mt2/2)1/3
whether one refers to coordinate time or to proper time. However, the final stages of infall are again very different, when expressed in terms of proper time ( τ 0 τ 0 tau longrightarrow0\tau \longrightarrow 0τ0, r 0 r 0 r longrightarrow0r \longrightarrow 0r0 ), from what they are as expressed in terms of Schwarzschild-coordinate time,
(25.39b) r / 2 M = 1 + 4 e 8 / 3 e t / 2 M (25.39b) r / 2 M = 1 + 4 e 8 / 3 e t / 2 M {:(25.39b)r//2M=1+4e^(-8//3)e^(-t//2M):}\begin{equation*} r / 2 M=1+4 e^{-8 / 3} e^{-t / 2 M} \tag{25.39b} \end{equation*}(25.39b)r/2M=1+4e8/3et/2M
Nonradial orbits:
(1) Fourier analysis
(2) details of angular motion
Turning from pure radial motion to motion endowed with angular momentum, one has a situation where one would like to express the principal quantities of the motion (components of displacement, velocity, and acceleration) in Fourier series (in Schwarzschild-coordinate time), these being so convenient in the Newtonian limit in analyzing radiation and perturbations of one orbit by another and tidal perturbations of the moving particle itself by the tide-producing action of the center of attraction. Any exact evaluation of these coefficients would appear difficult. For the time being, the values of the Fourier amplitudes would seem best developed by successive approximations starting from the Newtonian analysis (see Box 25.4 and references cited there).
In connection with any such Fourier analysis, it is appropriate to recall that the fundamental frequency alone appears, and all higher harmonics have zero amplitude, when the motion takes place in an exactly circular orbit (opposite extreme from the pure radial motion of L ~ = 0 L ~ = 0 widetilde(L)=0\widetilde{L}=0L~=0 ). Therefore it is of interest to note (exercise 25.19) that the circular frequency ω ω omega\omegaω of this motion, as measured by a faraway observer, is correlated with the Schwarzschild r r rrr-value of the orbit by exactly the Keplerian formula of non-relativistic physics:
(25.40) ω 2 r 3 = M (exact; general relativity ) . (25.40) ω 2 r 3 = M  (exact; general relativity  . {:(25.40){:omega^(2)r^(3)=M quad" (exact; general relativity ").:}\begin{equation*} \left.\omega^{2} r^{3}=M \quad \text { (exact; general relativity }\right) . \tag{25.40} \end{equation*}(25.40)ω2r3=M (exact; general relativity ).
Turn now from the correlation between r r rrr and time to the correlation between r r rrr and angle of revolution ( ϕ ϕ phi\phiϕ in the analysis here; θ θ theta\thetaθ in the Hamilton-Jacobi analysis of Box 25.4 ; this difference in name is irrelevant in what follows). Return to equation (25.16),
( d r d τ ) 2 + V ~ 2 ( r ) = E ~ 2 d r d τ 2 + V ~ 2 ( r ) = E ~ 2 ((dr)/(d tau))^(2)+ widetilde(V)^(2)(r)= widetilde(E)^(2)\left(\frac{d r}{d \tau}\right)^{2}+\widetilde{V}^{2}(r)=\widetilde{E}^{2}(drdτ)2+V~2(r)=E~2
and recall also equation (25.17)
d ϕ d τ = L ~ r 2 . d ϕ d τ = L ~ r 2 . (d phi)/(d tau)=(( widetilde(L)))/(r^(2)).\frac{d \phi}{d \tau}=\frac{\widetilde{L}}{r^{2}} .dϕdτ=L~r2.
Solve the second equation for d τ d τ d taud \taudτ, and substitute into the first to find
(25.41) ( L ~ d r r 2 d ϕ ˙ ) 2 + V ~ 2 ( r ) = E ~ 2 (25.41) L ~ d r r 2 d ϕ ˙ 2 + V ~ 2 ( r ) = E ~ 2 {:(25.41)((( widetilde(L))dr)/(r^(2)d(phi^(˙))))^(2)+ widetilde(V)^(2)(r)= widetilde(E)^(2):}\begin{equation*} \left(\frac{\widetilde{L} d r}{r^{2} d \dot{\phi}}\right)^{2}+\widetilde{V}^{2}(r)=\widetilde{E}^{2} \tag{25.41} \end{equation*}(25.41)(L~drr2dϕ˙)2+V~2(r)=E~2
or equivalently, with u = M / r u = M / r u=M//ru=M / ru=M/r and L = L ~ / M = L / M μ L = L ~ / M = L / M μ L^(†)= widetilde(L)//M=L//M muL^{\dagger}=\widetilde{L} / M=L / M \muL=L~/M=L/Mμ,
(25.42) ( d u d ϕ ) 2 = E ~ 2 ( 1 2 u ) ( 1 + L 2 u 2 ) L 2 . (25.42) d u d ϕ 2 = E ~ 2 ( 1 2 u ) 1 + L 2 u 2 L 2 . {:(25.42)((du)/(d phi))^(2)=( widetilde(E)^(2)-(1-2u)(1+L^(†2)u^(2)))/(L^(†2)).:}\begin{equation*} \left(\frac{d u}{d \phi}\right)^{2}=\frac{\widetilde{E}^{2}-(1-2 u)\left(1+L^{\dagger 2} u^{2}\right)}{L^{\dagger 2}} . \tag{25.42} \end{equation*}(25.42)(dudϕ)2=E~2(12u)(1+L2u2)L2.
Exercise 25.16 presents an alternative differential equation derived from this formula, and uses it to obtain the following expression for the angle swept out by the particle or planet, moving in a nearly circular orbit, between two successive points of closest approach:
(25.43) Δ ϕ = 2 π ( 1 6 M / r 0 ) 1 / 2 (25.43) Δ ϕ = 2 π 1 6 M / r 0 1 / 2 {:(25.43)Delta phi=(2pi)/((1-6M//r_(0))^(1//2)):}\begin{equation*} \Delta \phi=\frac{2 \pi}{\left(1-6 M / r_{0}\right)^{1 / 2}} \tag{25.43} \end{equation*}(25.43)Δϕ=2π(16M/r0)1/2
The radial motion turns around from ingoing to outgoing, or from outgoing to ingoing, whenever the quantity E ~ 2 V ~ 2 ( r ) E ~ 2 V ~ 2 ( r ) widetilde(E)^(2)- widetilde(V)^(2)(r)\widetilde{E}^{2}-\widetilde{V}^{2}(r)E~2V~2(r), or E ~ V ~ ( r ) E ~ V ~ ( r ) widetilde(E)- widetilde(V)(r)\widetilde{E}-\widetilde{V}(r)E~V~(r), plotted as a function of r r rrr, undergoes a change of sign, and this as clearly here in the correlation between r r rrr and ϕ ϕ phi\phiϕ as in the earlier correlation between r r rrr and time. Recall again the curves of Figure 25.2 for V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r) as a function of r r rrr for selected L ~ L ~ widetilde(L)\widetilde{L}L~ values. From them one can read out, without any calculation at all, the principal features of typical orbits (Box 25.6) obtained by detailed numerical calculation. Characteristic features are
(1) circular orbit when E ~ E ~ widetilde(E)\widetilde{E}E~ coincides with a minimum of the effective potential V ~ ( r ) V ~ ( r ) widetilde(V)(r)\widetilde{V}(r)V~(r),
(2) precession when E ~ E ~ widetilde(E)\widetilde{E}E~ is a little more than V ~ min V ~ min  widetilde(V)_("min ")\widetilde{V}_{\text {min }}V~min ,
(3) temporary "orbiting" (many turns around the center of attraction) when E ~ E ~ widetilde(E)\widetilde{E}E~ is close to a maximum V ~ max V ~ max  widetilde(V)_("max ")\widetilde{V}_{\text {max }}V~max  of the effective potential,
(4) "capture into the black hole" when E ~ E ~ widetilde(E)\widetilde{E}E~ exceeds V ~ max V ~ max  widetilde(V)_("max ")\widetilde{V}_{\text {max }}V~max .
A more detailed analysis appears in Box 25.6. [For explicit analytic calculation of orbits in the Schwarzschild geometry, see Hagihara (1931), Darwin (1959 and 1961), and Mielnik and Plebanski (1962).]
For orbits of positive energy, no feature of the inverse-square force is better known than the Rutherford scattering formula. It gives the "effective amount of target area" presented by the center of attraction for throwing particles into a faraway receptor that picks up everything coming off into a unit solid angle at a specified angle of deflection Θ Θ Theta\ThetaΘ :
(25.44) d σ d Ω = M 2 [ 4 ( E ~ 1 ) sin 2 Θ / 2 ] 2 (Rutherford; nonrelativistic) (25.44) d σ d Ω = M 2 4 ( E ~ 1 ) sin 2 Θ / 2 2  (Rutherford; nonrelativistic)  {:(25.44)(d sigma)/(d Omega)=(M^(2))/([4(( widetilde(E))-1)sin^(2)Theta//2]^(2))" (Rutherford; nonrelativistic) ":}\begin{equation*} \frac{d \sigma}{d \Omega}=\frac{M^{2}}{\left[4(\widetilde{E}-1) \sin ^{2} \Theta / 2\right]^{2}} \text { (Rutherford; nonrelativistic) } \tag{25.44} \end{equation*}(25.44)dσdΩ=M2[4(E~1)sin2Θ/2]2 (Rutherford; nonrelativistic) 
(derivation in equations 8 to 15 of Box 25.4). When one turns from the Newtonian analysis to the general-relativity treatment, one finds two striking new features of the scattering associated with the phenomenon of orbiting. (1) The particles that come off at a given angle of deflection Θ Θ Theta\ThetaΘ now include not only those that have really been deflected by Θ Θ Theta\ThetaΘ (the only contribution in Rutherford scattering), but also those that have been deflected by Θ + 2 π , Θ + 4 π , Θ + 2 π , Θ + 4 π , Theta+2pi,Theta+4pi,dots\Theta+2 \pi, \Theta+4 \pi, \ldotsΘ+2π,Θ+4π, etc. (an infinite series of contributions). (2) These supplementary contributions, while finite in amount, and even finite in amount "per unit range of Θ Θ Theta\ThetaΘ," are not finite in amount when expressed "per unit of solid angle d Ω = 2 π sin Θ d Θ d Ω = 2 π sin Θ d Θ d Omega=2pi sin Theta dTheta^('')d \Omega=2 \pi \sin \Theta d \Theta^{\prime \prime}dΩ=2πsinΘdΘ in either the forward direction ( Θ = 0 ) ( Θ = 0 ) (Theta=0)(\Theta=0)(Θ=0) or the backward direction ( Θ = π ) ( Θ = π ) (Theta=pi)(\Theta=\pi)(Θ=π). This circumstance produces no spectacular change in the forward scattering, for that is already infinite in the nonrelativistic approximation (infinity in Rutherford value of d σ / d Ω d σ / d Ω d sigma//d Omegad \sigma / d \Omegadσ/dΩ as Θ = 0 Θ = 0 Theta=0\Theta=0Θ=0 is approached, arising from
(3) nearly circular orbits: periastron shift
(4) qualitative features of angular motion
Scattering of incoming particles:
(1) Rutherford (nonrelativistic) cross section
(2) new features due to relativistic gravity
particles flying past with large impact parameters and experiencing small deflections; see exercise 25.21 ). In contrast, the backward scattering, which was perfectly finite in the Rutherford analysis, acquires also an infinity:
(25.45) ( d σ d Ω ) θ π constant sin Θ . (25.45) d σ d Ω θ π  constant  sin Θ . {:(25.45)((d sigma)/(d Omega))_(theta∼pi)∼(" constant ")/(sin Theta).:}\begin{equation*} \left(\frac{d \sigma}{d \Omega}\right)_{\theta \sim \pi} \sim \frac{\text { constant }}{\sin \Theta} . \tag{25.45} \end{equation*}(25.45)(dσdΩ)θπ constant sinΘ.
This concentration of scattering in the backward direction is known as a "glory." The effect is most readily seen by looking at the brilliant illumination that surrounds the shadow of one's plane on clouds far below ( 180 180 180^(@)180^{\circ}180 scattering of light ray within waterdrop). It is also clearly seen in observations on the scattering of atoms by atoms near Θ = 180 Θ = 180 Theta=180^(@)\Theta=180^{\circ}Θ=180. No dwarf star, not even any neutron star, is sufficiently compact to be out of the way of a high-speed particle trying to make such a 180 180 180^(@)180^{\circ}180 turn. Only a black hole is compact enough to produce this effect.
Further interesting features of motion in Schwarzschild geometry appear in the exercises below.

EXERCISES

Exercise 25.13. QUALITATIVE FORMS OF PARTICLE ORBITS

Verify the statements about particle orbits made in part C of Box 25.6 .

Exercise 25.14. IMPACT PARAMETER

For a scattering orbit (i.e., unbound orbit), show that L ~ = E ~ v b L ~ = E ~ v b widetilde(L)= widetilde(E)v_(oo)b\widetilde{L}=\widetilde{E} v_{\infty} bL~=E~vb, where b b bbb is the impact parameter and v v v_(oo)v_{\infty}v the asymptotic ordinary velocity; also show that
(25.46) b = L ~ / ( E ~ 2 1 ) 1 / 2 (25.46) b = L ~ / E ~ 2 1 1 / 2 {:(25.46)b= widetilde(L)//( widetilde(E)^(2)-1)^(1//2):}\begin{equation*} b=\widetilde{L} /\left(\widetilde{E}^{2}-1\right)^{1 / 2} \tag{25.46} \end{equation*}(25.46)b=L~/(E~21)1/2
Draw a picture illustrating the physical significance of the impact parameter.
Exercise 25.15. TIME TO FALL TO r = 2 M r = 2 M r=2Mr=2 Mr=2M
Show from equation (25.16) and the first picture in Box 25.6 that orbits (general L ~ L ~ widetilde(L)\widetilde{L}L~ value!) which approach r = 2 M r = 2 M r=2Mr=2 Mr=2M do so in a finite proper time, but (equation 25.32) an infinite coordinate time t t ttt. For equilibrium stars, which must have radii R > 2 M R > 2 M R > 2MR>2 MR>2M, the coordinate time t t ttt to fall to the surface is finite, of course.
Exercise 25.16. PERIASTRON SHIFT FOR NEARLY CIRCULAR ORBITS
Rewrite equation (25.42) in the form
(25.47) ( d u / d ϕ ) 2 + ( 1 6 u 0 ) ( u u 0 ) 2 2 ( u u 0 ) 3 = ( E ~ 2 E ~ 0 2 ) / L 2 (25.47) ( d u / d ϕ ) 2 + 1 6 u 0 u u 0 2 2 u u 0 3 = E ~ 2 E ~ 0 2 / L 2 {:(25.47)(du//d phi)^(2)+(1-6u_(0))(u-u_(0))^(2)-2(u-u_(0))^(3)=( widetilde(E)^(2)- widetilde(E)_(0)^(2))//L^(†2):}\begin{equation*} (d u / d \phi)^{2}+\left(1-6 u_{0}\right)\left(u-u_{0}\right)^{2}-2\left(u-u_{0}\right)^{3}=\left(\widetilde{E}^{2}-\widetilde{E}_{0}^{2}\right) / L^{\dagger 2} \tag{25.47} \end{equation*}(25.47)(du/dϕ)2+(16u0)(uu0)22(uu0)3=(E~2E~02)/L2
Express the constant u 0 M / r 0 u 0 M / r 0 u_(0)-=M//r_(0)u_{0} \equiv M / r_{0}u0M/r0 in terms of L ~ / M L ~ / M widetilde(L)//M\widetilde{L} / ML~/M, and express E ~ 0 E ~ 0 widetilde(E)_(0)\widetilde{E}_{0}E~0 in terms of u 0 u 0 u_(0)u_{0}u0. Show for a nearly circular orbit of radius r 0 r 0 r_(0)r_{0}r0 that the angle swept out between two successive periastra (points of closest approach to the star) is
(25.48) Δ ϕ = 2 π ( 1 6 M / r 0 ) 1 / 2 (25.48) Δ ϕ = 2 π 1 6 M / r 0 1 / 2 {:(25.48)Delta phi=2pi(1-6M//r_(0))^(-1//2):}\begin{equation*} \Delta \phi=2 \pi\left(1-6 M / r_{0}\right)^{-1 / 2} \tag{25.48} \end{equation*}(25.48)Δϕ=2π(16M/r0)1/2
Sketch the shape of the orbit for r 0 = 8 M r 0 = 8 M r_(0)=8Mr_{0}=8 Mr0=8M.

Exercise 25.17. ANGULAR MOTION DURING INFALL

From equation (25.42), deduce that the total angle Δ ϕ Δ ϕ Delta phi\Delta \phiΔϕ swept out on a trajectory falling into r = 0 r = 0 r=0r=0r=0 is finite. The computation is straightforward; but the interpretation, in view of the behavior of t ( λ ) t ( λ ) t(lambda)t(\lambda)t(λ) on the same trajectory (equation 25.32 and exercise 25.15 ), is not. The interpretation will be elucidated in Chapter 31.

Exercise 25.18. MAXIMUM AND MINIMUM OF EFFECTIVE POTENTIAL

Derive the expressions given in the caption of Figure 25.2 for the locations of the maximum and the minimum of the effective potential as a function of angular momentum. Determine also the limiting form of the dependence of barrier height on angular momentum in the limit in which L ~ L ~ widetilde(L)\widetilde{L}L~ is very large compared to M M MMM.

Exercise 25.19. KEPLER LAW VALID FOR CIRCULAR ORBITS

From d ϕ / d τ d ϕ / d τ d phi//d taud \phi / d \taudϕ/dτ of (25.17) and d t / d τ d t / d τ dt//d taud t / d \taudt/dτ of (25.18), deduce an expression for the circular frequency of revolution as seen by a faraway observer; and from the results of exercise 25.18 (or otherwise) show that it fulfills exactly the Kepler relation
ω 2 r 3 = M ω 2 r 3 = M omega^(2)r^(3)=M\omega^{2} r^{3}=Mω2r3=M
for any circular orbit of Schwarzschild r r rrr-value equal to r r rrr, whether stable (potential minimum) or unstable (potential maximum).

Exercise 25.20. HAMILTON-JACOBI FUNCTION

Construct the locus in the r , θ r , θ r,thetar, \thetar,θ diagram of points of constant dynamic phase S ~ ( t , r , θ ) = 0 S ~ ( t , r , θ ) = 0 widetilde(S)(t,r,theta)=0\widetilde{S}(t, r, \theta)=0S~(t,r,θ)=0 for t = 0 t = 0 t=0t=0t=0 and for values L ~ = 4 M , E ~ = 1 L ~ = 4 M , E ~ = 1 widetilde(L)=4M, widetilde(E)=1\widetilde{L}=4 M, \widetilde{E}=1L~=4M,E~=1 (or for L ~ = 2 3 M , E ~ = ( 8 / 9 ) 1 / 2 L ~ = 2 3 M , E ~ = ( 8 / 9 ) 1 / 2 widetilde(L)=2sqrt3M, widetilde(E)=(8//9)^(1//2)\widetilde{L}=2 \sqrt{3} M, \widetilde{E}=(8 / 9)^{1 / 2}L~=23M,E~=(8/9)1/2, or for some other equally simple set of values for these two parameters). Show that the whole set of surfaces of constant S ~ S ~ widetilde(S)\widetilde{S}S~ can be obtained by rotating the foregoing locus through one angle, then another and another, and recopying or retracing. Interpret physically the principal features of the resulting pattern of curves.

Exercise 25.21. DEFLECTION BY GRAVITY CONTRASTED WITH DEFLECTION BY ELECTRIC FORCE

A test particle of arbitrary velocity β β beta\betaβ flies past a mass M M MMM at an impact parameter b b bbb so great that the deflection is small. Show that the deflection is
(25.49) θ = 2 M b β 2 ( 1 + β 2 ) (25.49) θ = 2 M b β 2 1 + β 2 {:(25.49)theta=(2M)/(bbeta^(2))(1+beta^(2)):}\begin{equation*} \theta=\frac{2 M}{b \beta^{2}}\left(1+\beta^{2}\right) \tag{25.49} \end{equation*}(25.49)θ=2Mbβ2(1+β2)
Derive the deflection according to Newtonian mechanics for a particle moving with the speed of light. Show that (25.49) in the limit β 1 β 1 beta longrightarrow1\beta \longrightarrow 1β1 is twice the Newtonian deflection. Derive also (flat-space analysis) the contrasting formula for the deflection of a fast particle of rest mass μ μ mu\muμ and charge e e eee by a nucleus of charge Z e Z e ZeZ eZe,
(25.50) θ = 2 Z e 2 μ b β 2 ( 1 β 2 ) 1 / 2 (25.50) θ = 2 Z e 2 μ b β 2 1 β 2 1 / 2 {:(25.50)theta=(2Ze^(2))/(mu bbeta^(2))(1-beta^(2))^(1//2):}\begin{equation*} \theta=\frac{2 Z e^{2}}{\mu b \beta^{2}}\left(1-\beta^{2}\right)^{1 / 2} \tag{25.50} \end{equation*}(25.50)θ=2Ze2μbβ2(1β2)1/2
How feasible is it to rule out a "vector" theory of gravitation [see, for example, Brillouin (1970)], patterned after electromagnetism, by observations on the bending of light by the sun? [Hint: To simplify the mathematical analysis, go back to (25.42). Differentiate once with respect to ϕ ϕ phi\phiϕ to convert into a second-order equation. Rearrange to put on the left all those terms that would be there in the absence of gravity, and on the right all those that originate from the 2 u 2 u -2u-2 u2u term (gravitation) in the factor ( 1 2 u ) ( 1 2 u ) (1-2u)(1-2 u)(12u). Neglect the right-hand side of the equation and solve exactly (straight-line motion). Evaluate the perturbing term
on the right as a function of ϕ ϕ phi\phiϕ by inserting in it the unperturbed expression for u ( ϕ ) u ( ϕ ) u(phi)u(\phi)u(ϕ). Solve again and get the deflection.]

Exercise 25.22. CAPTURE BY A BLACK HOLE

Over and above any scattering of particles by a black hole, there is direct capture into the black hole. Show that the cross section for capture is π b crit 2 π b crit  2 pib_("crit ")^(2)\pi b_{\text {crit }}^{2}πbcrit 2, with the critical impact parameter b crit b crit  b_("crit ")b_{\text {crit }}bcrit  given by L crit / ( E 2 μ 2 ) 1 / 2 L crit  / E 2 μ 2 1 / 2 L_("crit ")//(E^(2)-mu^(2))^(1//2)L_{\text {crit }} /\left(E^{2}-\mu^{2}\right)^{1 / 2}Lcrit /(E2μ2)1/2. From the formulas in the caption of Fig. 25.2 or otherwise, show that for high-energy particles this cross section varies with energy as
(25.51) σ capt = 27 π M 2 ( 1 + 2 3 E ~ 2 + ) (25.51) σ capt  = 27 π M 2 1 + 2 3 E ~ 2 + {:(25.51)sigma_("capt ")=27 piM^(2)(1+(2)/(3 widetilde(E)^(2))+cdots):}\begin{equation*} \sigma_{\text {capt }}=27 \pi M^{2}\left(1+\frac{2}{3 \widetilde{E}^{2}}+\cdots\right) \tag{25.51} \end{equation*}(25.51)σcapt =27πM2(1+23E~2+)
(photon limit for E ~ E ~ widetilde(E)longrightarrow oo\widetilde{E} \longrightarrow \inftyE~ ) and for low energies as
(25.52) σ capt = 16 π M 2 / β 2 (25.52) σ capt  = 16 π M 2 / β 2 {:(25.52)sigma_("capt ")=16 piM^(2)//beta^(2):}\begin{equation*} \sigma_{\text {capt }}=16 \pi M^{2} / \beta^{2} \tag{25.52} \end{equation*}(25.52)σcapt =16πM2/β2
where β β beta\betaβ is the velocity relative to the velocity of light [Bogorodsky (1962)].

§25.6. ORBIT OF A PHOTON, NEUTRINO, OR GRAVITON IN SCHWARZSCHILD GEOMETRY

The concepts of "energy per unit of rest mass" and "angular momentum per unit of rest mass" make no sense for an object of zero rest mass (photon, neutrino, even the graviton of exercise 35.16). However, there is nothing about the motion of such an entity that cannot be discovered by considering the motion of a particle of finite rest mass μ μ mu\muμ and going to the limit μ 0 μ 0 mu longrightarrow0\mu \longrightarrow 0μ0. In this limit the quantities
E ~ = E / μ E ~ = E / μ widetilde(E)=E//mu\widetilde{E}=E / \muE~=E/μ
and
L ~ = L / μ L ~ = L / μ widetilde(L)=L//mu\widetilde{L}=L / \muL~=L/μ
individually go to infinity; but the ratio
(25.53) ( impact para- meter ) = b = ( angular momentum ) ( linear momentum ) = L ( E 2 μ 2 ) 1 / 2 = L ~ ( E ~ 2 1 ) 1 / 2 (25.53) (  impact para-   meter  ) = b = (  angular   momentum  ) (  linear   momentum  ) = L E 2 μ 2 1 / 2 = L ~ E ~ 2 1 1 / 2 {:(25.53)((" impact para- ")/(" meter "))=b=(((" angular ")/(" momentum ")))/(((" linear ")/(" momentum ")))=(L)/((E^(2)-mu^(2))^(1//2))=(( widetilde(L)))/(( widetilde(E)^(2)-1)^(1//2)):}\begin{equation*} \binom{\text { impact para- }}{\text { meter }}=b=\frac{\binom{\text { angular }}{\text { momentum }}}{\binom{\text { linear }}{\text { momentum }}}=\frac{L}{\left(E^{2}-\mu^{2}\right)^{1 / 2}}=\frac{\widetilde{L}}{\left(\widetilde{E}^{2}-1\right)^{1 / 2}} \tag{25.53} \end{equation*}(25.53)( impact para-  meter )=b=( angular  momentum )( linear  momentum )=L(E2μ2)1/2=L~(E~21)1/2
goes to the finite value
(25.54) Lim μ 0 L ~ E ~ = b (25.54) Lim μ 0 L ~ E ~ = b {:(25.54)Lim_(mu rarr0)(( widetilde(L)))/(( widetilde(E)))=b:}\begin{equation*} \operatorname{Lim}_{\mu \rightarrow 0} \frac{\widetilde{L}}{\widetilde{E}}=b \tag{25.54} \end{equation*}(25.54)Limμ0L~E~=b
In this limit, equation ( 25.41 ) ( 25.41 ) (25.41)(25.41)(25.41) for the shape of the orbit reduces at once to the simple form
(25.55) ( 1 r 2 d r d ϕ ) 2 + 1 2 M / r r 2 = 1 b 2 (25.55) 1 r 2 d r d ϕ 2 + 1 2 M / r r 2 = 1 b 2 {:(25.55)((1)/(r^(2))(dr)/(d phi))^(2)+(1-2M//r)/(r^(2))=(1)/(b^(2)):}\begin{equation*} \left(\frac{1}{r^{2}} \frac{d r}{d \phi}\right)^{2}+\frac{1-2 M / r}{r^{2}}=\frac{1}{b^{2}} \tag{25.55} \end{equation*}(25.55)(1r2drdϕ)2+12M/rr2=1b2
or
(25.56) ( 1 r 2 d r d ϕ ) 2 + B 2 ( r ) = b 2 (25.56) 1 r 2 d r d ϕ 2 + B 2 ( r ) = b 2 {:(25.56)((1)/(r^(2))(dr)/(d phi))^(2)+B^(-2)(r)=b^(-2):}\begin{equation*} \left(\frac{1}{r^{2}} \frac{d r}{d \phi}\right)^{2}+B^{-2}(r)=b^{-2} \tag{25.56} \end{equation*}(25.56)(1r2drdϕ)2+B2(r)=b2
or
(25.57) ( d u d ϕ ˙ ) 2 + u 2 ( 1 2 u ) = ( M b ) 2 1 b ~ 2 . (25.57) d u d ϕ ˙ 2 + u 2 ( 1 2 u ) = M b 2 1 b ~ 2 . {:(25.57)((du)/(d(phi^(˙))))^(2)+u^(2)(1-2u)=((M)/(b))^(2)-=(1)/( widetilde(b)^(2)).:}\begin{equation*} \left(\frac{d u}{d \dot{\phi}}\right)^{2}+u^{2}(1-2 u)=\left(\frac{M}{b}\right)^{2} \equiv \frac{1}{\widetilde{b}^{2}} . \tag{25.57} \end{equation*}(25.57)(dudϕ˙)2+u2(12u)=(Mb)21b~2.
Whichever way the differential equation for the orbit is written, one term in it depends on the choice of orbit (the term 1 / b 2 1 / b 2 1//b^(2)1 / b^{2}1/b2 ) the other on the properties of the Schwarzschild geometry, but not on the choice of orbit. This second term defines a kind of effective potential,
(25.58) ( "effective potential for photon" ) B 2 ( r ) 1 2 M / r r 2 (25.58)  "effective   potential for   photon"  B 2 ( r ) 1 2 M / r r 2 {:(25.58)([" "effective "],[" potential for "],[" photon" "])-=B^(-2)(r)-=(1-2M//r)/(r^(2)):}\left(\begin{array}{l} \text { "effective } \tag{25.58}\\ \text { potential for } \\ \text { photon" } \end{array}\right) \equiv B^{-2}(r) \equiv \frac{1-2 M / r}{r^{2}}(25.58)( "effective  potential for  photon" )B2(r)12M/rr2
No attempt is made here to take the square root, as was done for a particle of finite rest mass. There one took the root in order to have a quantity that reduced to the Newtonian effective potential (plus the rest mass) in the nonrelativistic limit; but for light ( v = 1 ) ( v = 1 ) (v=1)(v=1)(v=1) there is no nonrelativistic limit. Therefore the effective potential (25.58) is plotted directly in Box 25.7, and used there to analyze some of the principal features of the orbits of a photon in Schwarzschild geometry.
On occasion it has proved useful to plot as a function of r r rrr, not the "effective potential" of (25.58), but the "potential impact parameter B ( r ) B ( r ) B(r)B(r)B(r) " calculated from that formula [see, for example, Power and Wheeler (1957), Zel'dovich and Novikov (1971)]. This potential impact parameter has the following interpretation: A ray, in order to reach the point r r rrr, must have an impact parameter b b bbb that is equal to or less than B ( r ) B ( r ) B(r)B(r)B(r) :
(25.59) b B ( r ) ("condition of accessibility"). (25.59) b B ( r )  ("condition of accessibility").  {:(25.59)b <= B(r)" ("condition of accessibility"). ":}\begin{equation*} b \leq B(r) \text { ("condition of accessibility"). } \tag{25.59} \end{equation*}(25.59)bB(r) ("condition of accessibility"). 
A ray with zero impact parameter (head-on impact), or any impact parameter less than b crit = min [ B ( r ) ] = 3 3 M b crit  = min [ B ( r ) ] = 3 3 M b_("crit ")=min[B(r)]=3sqrt3Mb_{\text {crit }}=\min [B(r)]=3 \sqrt{3} Mbcrit =min[B(r)]=33M, can get to any and all r r rrr values.
The beautifully simple "effective potential" defined by (25.58) is used in (25.56) to determine the shape of an orbit; that is, the azimuth ϕ ϕ phi\phiϕ that the photon has when it gets to a given r r rrr-value. In other connections, it can be equally interesting to know when, or at what Schwarzschild coordinate time, the photon gets to a given r r rrr value. More broadly, the geodesic of a photon, for which proper time has no meaning, admits of analysis from first principles by way of an affine parameter λ λ lambda\lambdaλ, as contrasted with the device of first considering a particle and then going to the limit μ 0 μ 0 mu longrightarrow0\mu \longrightarrow 0μ0.
(4) critical impact parameter
(5) affine parameter

Box 25.7 QUALITATIVE ANALYSIS OF ORBITS OF A PHOTON

IN SCHWARZSCHILD GEOMETRY

A. Equations Governing Orbit

  1. Effective-potential equation for radial part of motion:
( d r d λ ) 2 + B 2 ( r ) = b 2 B 2 ( r ) = r 2 ( 1 2 M / r ) ; b = (impact parameter ) d r d λ 2 + B 2 ( r ) = b 2 B 2 ( r ) = r 2 ( 1 2 M / r ) ; b =  (impact parameter  ) {:[((dr)/(d lambda))^(2)+B^(-2)(r)=b^(-2)],[B^(-2)(r)=r^(-2)(1-2M//r);],[b=" (impact parameter ")]:}\begin{aligned} & \left(\frac{d r}{d \lambda}\right)^{2}+B^{-2}(r)=b^{-2} \\ & B^{-2}(r)=r^{-2}(1-2 M / r) ; \\ & b=\text { (impact parameter }) \end{aligned}(drdλ)2+B2(r)=b2B2(r)=r2(12M/r);b= (impact parameter )
  1. Supplementary equations to determine angular and time motion:
d ϕ / d λ = 1 / r 2 d t / d λ = b 1 ( 1 2 M / r ) 1 . d ϕ / d λ = 1 / r 2 d t / d λ = b 1 ( 1 2 M / r ) 1 . {:[d phi//d lambda=1//r^(2)],[dt//d lambda=b^(-1)(1-2M//r)^(-1).]:}\begin{gathered} d \phi / d \lambda=1 / r^{2} \\ d t / d \lambda=b^{-1}(1-2 M / r)^{-1} . \end{gathered}dϕ/dλ=1/r2dt/dλ=b1(12M/r)1.

B. Qualitative Features of Orbits (deduced from effective-potential diagram)
  1. A zero-mass particle with b > 3 3 M b > 3 3 M b > 3sqrt3Mb>3 \sqrt{3} Mb>33M, which falls in from r = r = r=oor=\inftyr=, is "reflected off the potential barrier" (periastron; b = B ; d r / d λ = 0 b = B ; d r / d λ = 0 b=B;dr//d lambda=0b=B ; d r / d \lambda=0b=B;dr/dλ=0 ) and returns to infinity.
    a. For b 3 3 M b 3 3 M b≫3sqrt3Mb \gg 3 \sqrt{3} Mb33M, the orbit is a straight line, except for a slight deflection of angle 4 M / b 4 M / b 4M//b4 M / b4M/b (exercise 25.21 ; $ 40.3 25.21 ; $ 40.3 25.21;$40.325.21 ; \$ 40.325.21;$40.3 ).
    b. For 0 < b 3 3 M M 0 < b 3 3 M M 0 < b-3sqrt3M≪M0<b-3 \sqrt{3} M \ll M0<b33MM, the particle circles the star many times ("unstable circular orbit) at r 3 M r 3 M r~~3Mr \approx 3 Mr3M before flying back to r = r = r=oor=\inftyr=.
  2. A zero-mass particle with b < 3 3 M b < 3 3 M b < 3sqrt3Mb<3 \sqrt{3} Mb<33M, which falls in from r = r = r=oor=\inftyr=, falls into r = 2 M r = 2 M r=2Mr=2 Mr=2M (no periastron).
  3. A zero-mass particle emitted from near r = 2 M r = 2 M r=2Mr=2 Mr=2M escapes to infinity only if it has b < 3 3 M b < 3 3 M b < 3sqrt3Mb<3 \sqrt{3} Mb<33M; otherwise it reaches an apastron and then gets pulled back into r = 2 M r = 2 M r=2Mr=2 Mr=2M.

C. Escape Versus Capture as a Function of Propagation Direction

An observer at rest in the Schwarzschild gravitational field measures the ordinary velocity of a zero-mass particle relative to his orthonormal frame [equations (23.15)]:
v r ^ = | g r r | 1 / 2 d r / d λ | g 00 | 1 / 2 d t / d λ = ± ( 1 b 2 / B 2 ) 1 / 2 ; v ϕ ^ = | g ϕ ϕ | 1 / 2 d ϕ / d λ | g 00 | 1 / 2 d t / d λ = b / B ; ( v r ^ ) 2 + ( v ϕ ^ ) 2 = 1 ; v r ^ = g r r 1 / 2 d r / d λ g 00 1 / 2 d t / d λ = ± 1 b 2 / B 2 1 / 2 ; v ϕ ^ = g ϕ ϕ 1 / 2 d ϕ / d λ g 00 1 / 2 d t / d λ = b / B ; v r ^ 2 + v ϕ ^ 2 = 1 ; {:[v_( hat(r))=(|g_(rr)|^(1//2)dr//d lambda)/(|g_(00)|^(1//2)dt//d lambda)=+-(1-b^(2)//B^(2))^(1//2);],[v_( hat(phi))=(|g_(phi phi)|^(1//2)d phi//d lambda)/(|g_(00)|^(1//2)dt//d lambda)=b//B;],[(v_( hat(r)))^(2)+(v_( hat(phi)))^(2)=1;]:}\begin{gathered} v_{\hat{r}}=\frac{\left|g_{r r}\right|^{1 / 2} d r / d \lambda}{\left|g_{00}\right|^{1 / 2} d t / d \lambda}= \pm\left(1-b^{2} / B^{2}\right)^{1 / 2} ; \\ v_{\hat{\phi}}=\frac{\left|g_{\phi \phi}\right|^{1 / 2} d \phi / d \lambda}{\left|g_{00}\right|^{1 / 2} d t / d \lambda}=b / B ; \\ \left(v_{\hat{r}}\right)^{2}+\left(v_{\hat{\phi}}\right)^{2}=1 ; \end{gathered}vr^=|grr|1/2dr/dλ|g00|1/2dt/dλ=±(1b2/B2)1/2;vϕ^=|gϕϕ|1/2dϕ/dλ|g00|1/2dt/dλ=b/B;(vr^)2+(vϕ^)2=1;
δ (angle between propagation direction and radial direction) = cos 1 v r ^ = sin 1 v ϕ ^ . δ  (angle between propagation direction and radial direction)  = cos 1 v r ^ = sin 1 v ϕ ^ . {:[delta-=" (angle between propagation direction and radial direction) "],[=cos^(-1)v_( hat(r))=sin^(-1)v_( hat(phi)).]:}\begin{aligned} \delta & \equiv \text { (angle between propagation direction and radial direction) } \\ & =\cos ^{-1} v_{\hat{r}}=\sin ^{-1} v_{\hat{\phi}} . \end{aligned}δ (angle between propagation direction and radial direction) =cos1vr^=sin1vϕ^.
To be able to cross over the potential barrier, the particle must have b < 3 3 M b < 3 3 M b < 3sqrt3Mb<3 \sqrt{3} Mb<33M, or v ϕ ^ 2 B 2 < 27 M 2 v ϕ ^ 2 B 2 < 27 M 2 v_( hat(phi))^(2)B^(2) < 27M^(2)v_{\hat{\phi}}{ }^{2} B^{2}<27 M^{2}vϕ^2B2<27M2, or sin 2 δ < 27 M 2 / B 2 sin 2 δ < 27 M 2 / B 2 sin^(2)delta < 27M^(2)//B^(2)\sin ^{2} \delta<27 M^{2} / B^{2}sin2δ<27M2/B2. This result, restated:
  1. A particle of zero rest mass at r < 3 M r < 3 M r < 3Mr<3 Mr<3M will eventually escape to infinity, rather than be captured by a black hole at r = 2 M r = 2 M r=2Mr=2 Mr=2M if and only if v r v r v_(r)v_{r}vr is positive and
sin δ < 3 3 M B 1 ( r ) sin δ < 3 3 M B 1 ( r ) sin delta < 3sqrt3MB^(-1)(r)\sin \delta<3 \sqrt{3} M B^{-1}(r)sinδ<33MB1(r)
  1. A particle of zero rest mass at r > 3 M r > 3 M r > 3Mr>3 Mr>3M will eventually escape to infinity if and only if: (1) v r v r v_(r)v_{r}vr is positive, or (2) v r v r v_(r)v_{r}vr is negative and
sin δ > 3 3 M B 1 ( r ) sin δ > 3 3 M B 1 ( r ) sin delta > 3sqrt3MB^(-1)(r)\sin \delta>3 \sqrt{3} M B^{-1}(r)sinδ>33MB1(r)
Return to the statement of the conservation laws (25.17) and (25.18) in the form that makes reference to the affine parameter λ λ lambda\lambdaλ but no reference to the rest mass μ μ mu\muμ; thus
(25.60) d ϕ d λ = L r 2 (25.60) d ϕ d λ = L r 2 {:(25.60)(d phi)/(d lambda)=(L)/(r^(2)):}\begin{equation*} \frac{d \phi}{d \lambda}=\frac{L}{r^{2}} \tag{25.60} \end{equation*}(25.60)dϕdλ=Lr2
and
(25.61) d t d λ = E 1 2 M / r (25.61) d t d λ = E 1 2 M / r {:(25.61)(dt)/(d lambda)=(E)/(1-2M//r):}\begin{equation*} \frac{d t}{d \lambda}=\frac{E}{1-2 M / r} \tag{25.61} \end{equation*}(25.61)dtdλ=E12M/r
Recall that the course of a photon in a gravitational field is governed by its direction but not by its energy. Therefore neither E E EEE nor L L LLL individually are relevant but only their ratio, the impact parameter b = L / E b = L / E b=L//Eb=L / Eb=L/E of (25.54) and exercise 25.14. This circumstance leads one to replace the affine parameter λ λ lambda\lambdaλ by a new affine parameter,
(25.62) λ new = L λ , (25.62) λ new  = L λ , {:(25.62)lambda_("new ")=L lambda",":}\begin{equation*} \lambda_{\text {new }}=L \lambda, \tag{25.62} \end{equation*}(25.62)λnew =Lλ,
that is equally constant along the world line of the photon. In this notation (drop the subscript "new" hereafter), the conservation laws take the form
(25.63) d ϕ d λ = 1 r 2 (25.64) d t d λ = 1 b ( 1 2 M / r ) (25.63) d ϕ d λ = 1 r 2 (25.64) d t d λ = 1 b ( 1 2 M / r ) {:[(25.63)(d phi)/(d lambda)=(1)/(r^(2))],[(25.64)(dt)/(d lambda)=(1)/(b(1-2M//r))]:}\begin{gather*} \frac{d \phi}{d \lambda}=\frac{1}{r^{2}} \tag{25.63}\\ \frac{d t}{d \lambda}=\frac{1}{b(1-2 M / r)} \tag{25.64} \end{gather*}(25.63)dϕdλ=1r2(25.64)dtdλ=1b(12M/r)
The statement that the world line of the photon is a line of zero lapse of proper time,
(25.65) g α β d x α d λ d x β d λ = 0 (25.65) g α β d x α d λ d x β d λ = 0 {:(25.65)g_(alpha beta)(dx^(alpha))/(d lambda)(dx^(beta))/(d lambda)=0:}\begin{equation*} g_{\alpha \beta} \frac{d x^{\alpha}}{d \lambda} \frac{d x^{\beta}}{d \lambda}=0 \tag{25.65} \end{equation*}(25.65)gαβdxαdλdxβdλ=0
leads to the "radial equation"
(25.66) ( d r d λ ) 2 + B 2 ( r ) = b 2 (25.66) d r d λ 2 + B 2 ( r ) = b 2 {:(25.66)((dr)/(d lambda))^(2)+B^(-2)(r)=b^(-2):}\begin{equation*} \left(\frac{d r}{d \lambda}\right)^{2}+B^{-2}(r)=b^{-2} \tag{25.66} \end{equation*}(25.66)(drdλ)2+B2(r)=b2
Here one encounters again the "effective potential" B 2 ( r ) B 2 ( r ) B^(-2)(r)B^{-2}(r)B2(r) of ( 25.58 ) ( 25.58 ) (25.58)(25.58)(25.58). The present fuller set of equations for the geodesic of a photon have the advantage that they reach beyond space to a description of the world line in spacetime.
Return to space! Figure 25.6 shows typical orbits for a photon in Schwarzschild geometry. Figure 25.7 shows angle of deflection as a function of impact parameter.
(7) scattering cross section From the information contained in this curve, one can evaluate the contributions to the differential scattering cross section
(25.67) d σ d Ω = "branches" | 2 π b d b 2 π sin Θ d Θ | (25.67) d σ d Ω = "branches"  2 π b d b 2 π sin Θ d Θ {:(25.67)(d sigma)/(d Omega)=sum_(""branches" ")|(2pi bdb)/(2pi sin Theta d Theta)|:}\begin{equation*} \frac{d \sigma}{d \Omega}=\sum_{\text {"branches" }}\left|\frac{2 \pi b d b}{2 \pi \sin \boldsymbol{\Theta} d \boldsymbol{\Theta}}\right| \tag{25.67} \end{equation*}(25.67)dσdΩ="branches" |2πbdb2πsinΘdΘ|
from the various "branches" of the scattering curve of Figure 25.7 [one turn around the center of attraction, two turns, etc.; for more on these branches and the central
Figure 25.6.
The orbit of a photon in the "equatorial plane" of a black hole, plotted in terms of the Schwarzschild coordinates r r rrr and ϕ ϕ phi\phiϕ, for selected values of the turning point of the orbit, r TP / M = 2.99 , 3.00 r TP / M = 2.99 , 3.00 r_(TP)//M=2.99,3.00r_{\mathrm{TP}} / M=2.99,3.00rTP/M=2.99,3.00 (unstable circular orbit), 3.01 , 3.5 , 4 , 5 , 6 , 7 , 8 , 9 3.01 , 3.5 , 4 , 5 , 6 , 7 , 8 , 9 3.01,3.5,4,5,6,7,8,93.01,3.5,4,5,6,7,8,93.01,3.5,4,5,6,7,8,9. The impact parameter is given by the formula b = r TP ( 1 2 M / r TP ) 1 / 2 b = r TP 1 2 M / r TP 1 / 2 b=r_(TP)(1-2M//r_(TP))^(-1//2)b=r_{\mathrm{TP}}\left(1-2 M / r_{\mathrm{TP}}\right)^{-1 / 2}b=rTP(12M/rTP)1/2. In none of the cases shown, even for the inward plunging spiral, is the impact parameter less than b crit = ( 27 ) 1 / 2 M b crit  = ( 27 ) 1 / 2 M b_("crit ")=(27)^(1//2)Mb_{\text {crit }}=(27)^{1 / 2} Mbcrit =(27)1/2M, nor are any of these orbits able to cross the circle r = 3 M r = 3 M r=3Mr=3 \mathrm{M}r=3M. That only happens for orbits with b b bbb less than b crit b crit  b_("crit ")b_{\text {crit }}bcrit . For such orbits there is no turning point; the photon comes in from infinity and ends up at r = 0 r = 0 r=0r=0r=0 : straight in for b = 0 b = 0 b=0b=0b=0 (head-on impact); only after many loops near r = 3 M r = 3 M r=3Mr=3 Mr=3M, when b / M = ( 27 ) 1 / 2 ε b / M = ( 27 ) 1 / 2 ε b//M=(27)^(1//2)-epsib / M=(27)^{1 / 2}-\varepsilonb/M=(27)1/2ε, where ε ε epsi\varepsilonε is a very small quantity. Appreciation is expressed to Prof. R. H. Dicke for permission to publish these curves, which he had a digital calculator compute and plot out directly from the formula d 2 u / d ϕ 2 = d 2 u / d ϕ 2 = d^(2)u//dphi^(2)=d^{2} u / d \phi^{2}=d2u/dϕ2= 3 u 2 u 3 u 2 u 3u^(2)-u3 u^{2}-u3u2u, where u = M / r u = M / r u=M//ru=M / ru=M/r.
role of the deflection function Θ = Θ ( b ) Θ = Θ ( b ) Theta=Theta(b)\Theta=\Theta(b)Θ=Θ(b) in the analysis of scattering, see, for example, Ford and Wheeler (1959a,b)]. For small angles the "Rutherford" part of the scattering predominates. The major part of the small-angle scattering, and in the limit Θ 0 Θ 0 Theta longrightarrow0\Theta \longrightarrow 0Θ0 all of it, comes from large impact parameters, for which one has
(25.68) Θ = 4 M b (25.68) Θ = 4 M b {:(25.68)Theta=(4M)/(b):}\begin{equation*} \Theta=\frac{4 M}{b} \tag{25.68} \end{equation*}(25.68)Θ=4Mb
(see exercises 25.21 and 25.24 ). It follows that the limiting form of the cross section is
(25.69) d σ d Ω = ( 4 M Θ 2 ) 2 ( small Θ ) (25.69) d σ d Ω = 4 M Θ 2 2 (  small  Θ ) {:(25.69)(d sigma)/(d Omega)=((4M)/(Theta^(2)))^(2)quad(" small "Theta):}\begin{equation*} \frac{d \sigma}{d \Omega}=\left(\frac{4 M}{\Theta^{2}}\right)^{2} \quad(\text { small } \Theta) \tag{25.69} \end{equation*}(25.69)dσdΩ=(4MΘ2)2( small Θ)
Also, at Θ = π Θ = π Theta=pi\Theta=\piΘ=π one has a singularity in the differential scattering cross section, with the character of a glory [see discussion following equation (25.44)]. Writing down the contributions of the several branches of the scattering function to the differential cross section, and summing them, one has, near Θ = π Θ = π Theta=pi\Theta=\piΘ=π,
(25.70) d σ d Ω = M 2 π Θ ( 1.75 + 0.0029 + 0.0000055 + ) = 1.75 M 2 π Θ . (25.70) d σ d Ω = M 2 π Θ ( 1.75 + 0.0029 + 0.0000055 + ) = 1.75 M 2 π Θ . {:(25.70)(d sigma)/(d Omega)=(M^(2))/(pi-Theta)(1.75+0.0029+0.0000055+cdots)=1.75(M^(2))/(pi-Theta).:}\begin{equation*} \frac{d \sigma}{d \Omega}=\frac{M^{2}}{\pi-\Theta}(1.75+0.0029+0.0000055+\cdots)=1.75 \frac{M^{2}}{\pi-\Theta} . \tag{25.70} \end{equation*}(25.70)dσdΩ=M2πΘ(1.75+0.0029+0.0000055+)=1.75M2πΘ.
Thus, in principle, if one shines a powerful source of light onto a black hole, one gets a direct return of a few photons from it. Equation (25.70) provides a means to calculate the strength of this return. See exercise 25.26.
Figure 25.7.
Deflection of a photon by a Schwarzschild black hole, or by any spherically symmetric center of attraction small enough not to block the trajectory of the photon. The accurate calculations (smooth curves) are compared with formulas (dashed curves) valid asymptotically in the two limiting cases of an impact parameter, b : b : b:b:b: (1) very close to b crit = 3 3 / 2 M b crit  = 3 3 / 2 M b_("crit ")=3^(3//2)Mb_{\text {crit }}=3^{3 / 2} Mbcrit =33/2M (many turns around the center of attraction); and (2) very large compared to b erit b erit  b_("erit ")b_{\text {erit }}berit  (small deflection). The algorithm for the accurate calculation of the deflection proceeds as follows (all distances being given, for simplicity, in units of the mass value, M M MMM ). (1) Choose a value, r = R r = R r=Rr=Rr=R, for the Schwarzschild coordinate of the point of closest approach. (2) Calculate the impact parameter, b b bbb, from b 2 = R 3 / ( R 2 ) b 2 = R 3 / ( R 2 ) b^(2)=R^(3)//(R-2)b^{2}=R^{3} /(R-2)b2=R3/(R2). (3) Calculate Q Q QQQ from Q 2 = ( R 2 ) ( R + 6 ) Q 2 = ( R 2 ) ( R + 6 ) Q^(2)=(R-2)(R+6)Q^{2}=(R-2)(R+6)Q2=(R2)(R+6). (4) Determine the modulus, k k kkk, of an "elliptic integral of the first kind" from sin 2 θ = k 2 = ( Q R + 6 ) / 2 Q sin 2 θ = k 2 = ( Q R + 6 ) / 2 Q sin^(2)theta=k^(2)=(Q-R+6)//2Q\sin ^{2} \theta=k^{2}=(Q-R+6) / 2 Qsin2θ=k2=(QR+6)/2Q. (5) Determine the so-called amplitude ϕ = ϕ min ϕ = ϕ min phi=phi_(min)\phi=\phi_{\min }ϕ=ϕmin of the same elliptic function from sn 2 u min = sin 2 ϕ ˙ min = sn 2 u min = sin 2 ϕ ˙ min = sn^(2)u_(min)=sin^(2)phi^(˙)_(min)=\operatorname{sn}^{2} u_{\min }=\sin ^{2} \dot{\phi}_{\min }=sn2umin=sin2ϕ˙min= ( 2 + Q R ) / ( 6 + Q R ) ( 2 + Q R ) / ( 6 + Q R ) (2+Q-R)//(6+Q-R)(2+Q-R) /(6+Q-R)(2+QR)/(6+QR). (6) Then the total deflection is
Θ = 4 ( R / Q ) 1 / 2 [ F ( π / 2 , θ ) F ( ϕ min , θ ) ] π Θ = 4 ( R / Q ) 1 / 2 F ( π / 2 , θ ) F ϕ min , θ π Theta=4(R//Q)^(1//2)[F(pi//2,theta)-F(phi_(min),theta)]-pi\Theta=4(R / Q)^{1 / 2}\left[F(\pi / 2, \theta)-F\left(\phi_{\min }, \theta\right)\right]-\piΘ=4(R/Q)1/2[F(π/2,θ)F(ϕmin,θ)]π
The values plotted here were kindly calculated by James A. Isenberg on the basis of the work of C. G. Darwin (1959, 1961).
(8) gravitational lens effect
When the source of illumination, instead of being on the observer's side of the black hole, is on the opposite side, then in addition to the "lens effect" experienced by photons flying by with large impact parameter [literature too vast to summarize here, but see, e.g., Refsdal (1964)], and subsumed in equation (25.68), there is a glory type of illumination (intensity 1 / sin Θ 1 / sin Θ ∼1//sin Theta\sim 1 / \sin \Theta1/sinΘ, with now, however, Θ Θ Theta\ThetaΘ close to zero) received from photons that have experienced deflections Θ = 2 π , 4 π , Θ = 2 π , 4 π , Theta=2pi,4pi,dots\Theta=2 \pi, 4 \pi, \ldotsΘ=2π,4π,. This illumination comes from "rings of brightness" located at impact parameters given by b / M 3 3 / 2 = 0.0065 , 0.000012 , b / M 3 3 / 2 = 0.0065 , 0.000012 , b//M-3^(3//2)=0.0065,0.000012,dotsb / M-3^{3 / 2}=0.0065,0.000012, \ldotsb/M33/2=0.0065,0.000012,. Interesting though all these optical effects are as matters of principle, they are, among all the ways to observe a black hole, the worst; see part VI, C, of Box 33.3 for a detailed discussion.

Exercise 25.23. QUALITATIVE FEATURES OF PHOTON ORBITS

EXERCISES

Verify all the statements about orbits for particles of zero rest mass made in Box 25.7 .

Exercise 25.24. LIGHT DEFLECTION

Using the dimensionless variable u = M / r u = M / r u=M//ru=M / ru=M/r in place of r r rrr itself, and u b = M / b u b = M / b u_(b)=M//bu_{b}=M / bub=M/b in place of the impact parameter, transform (25.55) into the first-order equation
(25.71) ( d u d ϕ ) 2 + ( 1 2 u ) u 2 = u b 2 (25.71) d u d ϕ 2 + ( 1 2 u ) u 2 = u b 2 {:(25.71)((du)/(d phi))^(2)+(1-2u)u^(2)=u_(b)^(2):}\begin{equation*} \left(\frac{d u}{d \phi}\right)^{2}+(1-2 u) u^{2}=u_{b}^{2} \tag{25.71} \end{equation*}(25.71)(dudϕ)2+(12u)u2=ub2
and thence, by differentiation, into
(25.72) d 2 u d ϕ 2 + u = 3 u 2 (25.72) d 2 u d ϕ 2 + u = 3 u 2 {:(25.72)(d^(2)u)/(dphi^(2))+u=3u^(2):}\begin{equation*} \frac{d^{2} u}{d \phi^{2}}+u=3 u^{2} \tag{25.72} \end{equation*}(25.72)d2udϕ2+u=3u2
(a) In the large-impact-parameter or small- u u uuu approximation, in which the term on the right is neglected, show that the solution of ( 25.72 ) ( 25.72 ) (25.72)(25.72)(25.72) yields elementary rectilinear motion (zero deflection).
(b) Insert this zero-order solution into the perturbation term 3 u 2 3 u 2 3u^(2)3 u^{2}3u2 on the righthand side of (25.72), and solve anew for u u uuu ("rectilinear motion plus first-order correction"). In this way, verify the formula for the bending of light by the sun given by putting β = 1 β = 1 beta=1\beta=1β=1 in equation (25.49).

Exercise 25.25. CAPTURE OF LIGHT BY A BLACK HOLE

Show that a Schwarzschild black hole presents a cross section σ capt = 27 π M 2 σ capt  = 27 π M 2 sigma_("capt ")=27 piM^(2)\sigma_{\text {capt }}=27 \pi M^{2}σcapt =27πM2 for capture of light.

Exercise 25.26. RETURN OF LIGHT FROM A BLACK HOLE

Show that flashing a powerful pulse of light onto a black hole leads in principle to a return from rings of brightness located at b / M 3 3 / 2 = 0.151 , 0.00028 , b / M 3 3 / 2 = 0.151 , 0.00028 , b//M-3^(3//2)=0.151,0.00028,dotsb / M-3^{3 / 2}=0.151,0.00028, \ldotsb/M33/2=0.151,0.00028,. How can one evaluate the difference in time delays of these distinct returns? Show that the intensity I I III of the return (erg / cm 2 / cm 2 //cm^(2)/ \mathrm{cm}^{2}/cm2 ) as a function of the energy E 0 ( erg / E 0 ( erg / E_(0)(erg//E_{0}(\mathrm{erg} /E0(erg/ steradian) of the original pulse, the mass M ( cm ) M ( cm ) M(cm)M(\mathrm{~cm})M( cm) of the black hole, the distance R R RRR to it, and the lateral distance r r rrr from the "flashlight" to the receptor of returned radiation is
I = E 0 R 3 r θ = ( 2 N + 1 ) π | 2 b d b d Θ | = E 0 M 2 R 3 r ( 1.75 + 0.0029 + 0.0000055 + ) I = E 0 R 3 r θ = ( 2 N + 1 ) π 2 b d b d Θ = E 0 M 2 R 3 r ( 1.75 + 0.0029 + 0.0000055 + ) I=(E_(0))/(R^(3)r)sum_(theta=(2N+1)pi)|(2bdb)/(d Theta)|=(E_(0)M^(2))/(R^(3)r)(1.75+0.0029+0.0000055+cdots)I=\frac{E_{0}}{R^{3} r} \sum_{\theta=(2 N+1) \pi}\left|\frac{2 b d b}{d \Theta}\right|=\frac{E_{0} M^{2}}{R^{3} r}(1.75+0.0029+0.0000055+\cdots)I=E0R3rθ=(2N+1)π|2bdbdΘ|=E0M2R3r(1.75+0.0029+0.0000055+)
under conditions where diffraction can be neglected.

§25.7. SPHERICAL STAR CLUSTERS

By combining orbit theory, as developed in this chapter, with kinetic theory in curved spacetime as developed in § 22.6 § 22.6 §22.6\S 22.6§22.6, one can formulate the theory of relativistic star clusters.
Consider, for simplicity, a spherically symmetric cluster of stars (e.g., a globular cluster, but one so dense that relativistic gravitational effects might be important).
(1) foundations for analysis
(2) solution of Vlasoff equation
Demand that the cluster be static, in the sense that the number density in phase space R R R\mathscr{R}R is independent of time. (New stars, flying along geodesic orbits, enter a fixed region in phase space at the same rate as "old" stars leave it.) Ignore collisions and close encounters between stars; i.e., treat each star's orbit as a geodesic in the spherically symmetric spacetime of the cluster as a whole.
With these idealizations accepted, one can write down a manageable set of equations for the structure of the cluster.* Since the cluster is static and spherical, so must be its gravitational field. Consequently, one can introduce the same kind of coordinate system ("Schwarzschild coordinates") as was used for a static spherical star in Chapter 23:
(25.73) d s 2 = e 2 ϕ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 ; Φ = Φ ( r ) , Λ = Λ ( r ) (25.73) d s 2 = e 2 ϕ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 ; Φ = Φ ( r ) , Λ = Λ ( r ) {:(25.73)ds^(2)=-e^(2phi)dt^(2)+e^(2Lambda)dr^(2)+r^(2)dOmega^(2);quad Phi=Phi(r)","quad Lambda=Lambda(r):}\begin{equation*} d s^{2}=-e^{2 \phi} d t^{2}+e^{2 \Lambda} d r^{2}+r^{2} d \Omega^{2} ; \quad \Phi=\Phi(r), \quad \Lambda=\Lambda(r) \tag{25.73} \end{equation*}(25.73)ds2=e2ϕdt2+e2Λdr2+r2dΩ2;Φ=Φ(r),Λ=Λ(r)
In the tangent space at each event in spacetime reside the momentum vectors of the swarming stars. For coordinates in this tangent space ("momentum space"), it is convenient to use the physical components of 4 -momentum, p α ^ p α ^ p^( hat(alpha))p^{\hat{\alpha}}pα^-i.e., components on the orthonormal frame
(25.74) ω t ^ = e ϕ d t , ω r ^ = e Λ d r , ω θ ^ = r d θ , ω ϕ ^ = r sin θ d ϕ (25.74) ω t ^ = e ϕ d t , ω r ^ = e Λ d r , ω θ ^ = r d θ , ω ϕ ^ = r sin θ d ϕ {:(25.74)omega^( hat(t))=e^(phi)dt","quadomega^( hat(r))=e^(Lambda)dr","quadomega^( hat(theta))=rd theta","quadomega^( hat(phi))=r sin theta d phi:}\begin{equation*} \boldsymbol{\omega}^{\hat{t}}=e^{\phi} \boldsymbol{d} t, \quad \boldsymbol{\omega}^{\hat{r}}=e^{\Lambda} \boldsymbol{d} r, \quad \boldsymbol{\omega}^{\hat{\theta}}=r \boldsymbol{d} \theta, \quad \boldsymbol{\omega}^{\hat{\phi}}=r \sin \theta \boldsymbol{d} \phi \tag{25.74} \end{equation*}(25.74)ωt^=eϕdt,ωr^=eΛdr,ωθ^=rdθ,ωϕ^=rsinθdϕ
Then the number density of stars in phase space is a spherically symmetric, static function
(25.75) R = R [ r , p 0 ^ , p r ^ , ( p θ ^ 2 + p ϕ ^ 2 ) 1 / 2 ] . (25.75) R = R r , p 0 ^ , p r ^ , p θ ^ 2 + p ϕ ^ 2 1 / 2 . {:(25.75)R=R[r,p^( hat(0)),p^( hat(r)),(p^( hat(theta)2)+p^( hat(phi)2))^(1//2)].:}\begin{equation*} \mathscr{R}=\mathscr{R}\left[r, p^{\hat{0}}, p^{\hat{r}},\left(p^{\hat{\theta} 2}+p^{\hat{\phi} 2}\right)^{1 / 2}\right] . \tag{25.75} \end{equation*}(25.75)R=R[r,p0^,pr^,(pθ^2+pϕ^2)1/2].
[ :'\because is independent of t t ttt because the cluster is static; and independent of θ , ϕ θ , ϕ theta,phi\theta, \phiθ,ϕ, and angle Θ = tan 1 ( p ϕ ^ / p θ ^ ) Θ = tan 1 p ϕ ^ / p θ ^ Theta=tan^(-1)(p^( hat(phi))//p^( hat(theta)))\Theta=\tan ^{-1}\left(p^{\hat{\phi}} / p^{\hat{\theta}}\right)Θ=tan1(pϕ^/pθ^) because of spherical symmetry.]
The functions describing the structure of the cluster, Φ , Λ Φ , Λ Phi,Lambda\Phi, \LambdaΦ,Λ, and π π pi\mathscr{\pi}π, are determined by the kinetic (also, in this context, called the Vlasoff) equation ( $ 22.6 $ 22.6 $22.6\$ 22.6$22.6 )
(25.76a) d R / d λ = 0 , i.e., R conserved along orbit of each star in phase space; (25.76a) d R / d λ = 0 ,  i.e.,  R  conserved along orbit   of each star in phase space;  {:(25.76a){:[dR//d lambda=0","" i.e., "R" conserved along orbit "],[" of each star in phase space; "]:}:}\begin{array}{r} d \mathscr{R} / d \lambda=0, \text { i.e., } \mathscr{R} \text { conserved along orbit } \tag{25.76a}\\ \text { of each star in phase space; } \end{array}(25.76a)dR/dλ=0, i.e., R conserved along orbit  of each star in phase space; 
and by the Einstein field equations
(25.76b) G α ^ β ^ = 8 π T α ^ β ^ = 8 π ( ϰ p α ^ p β ^ ) μ 1 d p o ^ d p r ^ d p θ ^ d p ϕ ^ (25.76b) G α ^ β ^ = 8 π T α ^ β ^ = 8 π ϰ p α ^ p β ^ μ 1 d p o ^ d p r ^ d p θ ^ d p ϕ ^ {:(25.76b)G^( hat(alpha) hat(beta))=8piT^( hat(alpha) hat(beta))=8pi int(ϰp^( hat(alpha))p^( hat(beta)))mu^(-1)dp^( hat(o))dp^( hat(r))dp^( hat(theta))dp^( hat(phi)):}\begin{equation*} G^{\hat{\alpha} \hat{\beta}}=8 \pi T^{\hat{\alpha} \hat{\beta}}=8 \pi \int\left(\varkappa p^{\hat{\alpha}} p^{\hat{\beta}}\right) \mu^{-1} d p^{\hat{o}} d p^{\hat{r}} d p^{\hat{\theta}} d p^{\hat{\phi}} \tag{25.76b} \end{equation*}(25.76b)Gα^β^=8πTα^β^=8π(ϰpα^pβ^)μ1dpo^dpr^dpθ^dpϕ^
[The Vlasoff equation for Newtonian star clusters is treated by Ogorodnikov (1965). The above expression for the stress-energy tensor of a swarm of particles (stars) was derived in exercise 22.18. Here, as in exercise 22.18, the particles (stars) are assumed not all to have the same rest mass. Note that rest mass is here denoted μ μ mu\muμ, but in Chapter 22 it was denoted m m mmm.]
To solve the Vlasoff equation, one need only note that R R R\mathscr{R}R is conserved along stellar orbits and therefore must be a function of the constants of the orbital motion. There is a constant of motion corresponding to each Killing vector in the cluster's static, spherical spacetime (see exercise 25.8 ):
E = "energy at infinity" = p ( / t ) = p 0 , E =  "energy at infinity"  = p ( / t ) = p 0 , E=" "energy at infinity" "=-p*(del//del t)=-p_(0),E=\text { "energy at infinity" }=-\boldsymbol{p} \cdot(\partial / \partial t)=-p_{0},E= "energy at infinity" =p(/t)=p0,
L z = " z -component of angular momentum" = p ξ z = p ( / ϕ ) = p ϕ L z = " z -component of angular momentum"  = p ξ z = p ( / ϕ ) = p ϕ L_(z)="z"-component of angular momentum" "=p*xi_(z)=p*(del//del phi)=p_(phi)L_{z}=" z \text {-component of angular momentum" }=p \cdot \xi_{z}=p \cdot(\partial / \partial \phi)=p_{\phi}Lz="z-component of angular momentum" =pξz=p(/ϕ)=pϕ
$$
(25.77a) L y = " y -component of angular momentum" = p ξ y , (25.77a) L y = " y -component of angular momentum"  = p ξ y , {:(25.77a)L_(y)="y"-component of angular momentum" "=p*xi_(y)",":}\begin{equation*} L_{y}=" y \text {-component of angular momentum" }=p \cdot \xi_{y}, \tag{25.77a} \end{equation*}(25.77a)Ly="y-component of angular momentum" =pξy,
$ L x = $ " $ x $ c o m p o n e n t o f a n g u l a r m o m e n t u m " $ = p ξ x $ . I n a d d i t i o n , e a c h s t a r s r e s t m a s s $ L x = $ " $ x $ c o m p o n e n t o f a n g u l a r m o m e n t u m " $ = p ξ x $ . I n a d d i t i o n , e a c h s t a r s r e s t m a s s $L_(x)=$"$x$-componentofangularmomentum"$=p*xi_(x)$.Inaddition,eachstar^(')srestmass$L_{x}=$ " $x$-component of angular momentum" $=p \cdot \xi_{x}$. In addition, each star's rest mass$Lx=$"$x$componentofangularmomentum"$=pξx$.Inaddition,eachstarsrestmass
(25.77b) μ = ( p 0 ^ 2 p r ^ 2 p θ ^ 2 p ϕ ^ 2 ) 1 / 2 (25.77b) μ = p 0 ^ 2 p r ^ 2 p θ ^ 2 p ϕ ^ 2 1 / 2 {:(25.77b)mu=(p^( hat(0)2)-p^( hat(r)2)-p^( hat(theta)2)-p^( hat(phi)2))^(1//2):}\begin{equation*} \mu=\left(p^{\hat{0} 2}-p^{\hat{r} 2}-p^{\hat{\theta} 2}-p^{\hat{\phi} 2}\right)^{1 / 2} \tag{25.77b} \end{equation*}(25.77b)μ=(p0^2pr^2pθ^2pϕ^2)1/2
i s a c o n s t a n t o f i t s m o t i o n . T h e g e n e r a l s o l u t i o n o f t h e V l a s o f f e q u a t i o n , t h e n , h a s t h e f o r m i s a c o n s t a n t o f i t s m o t i o n . T h e g e n e r a l s o l u t i o n o f t h e V l a s o f f e q u a t i o n , t h e n , h a s t h e f o r m isaconstantofitsmotion.ThegeneralsolutionoftheVlasoffequation,then,hastheformis a constant of its motion. The general solution of the Vlasoff equation, then, has the formisaconstantofitsmotion.ThegeneralsolutionoftheVlasoffequation,then,hastheform
\mathscr{T}=H\left(E, L_{x}, L_{y}, L_{z}, \mu\right)
$$
But this general solution is not spherically symmetric. For example, the distribution function
= H ( E , μ , L z ) δ ( L y ) δ ( L x ) , = H E , μ , L z δ L y δ L x , ℜ=H(E,mu,L_(z))delta(L_(y))delta(L_(x)),\mathscr{\Re}=H\left(E, \mu, L_{z}\right) \delta\left(L_{y}\right) \delta\left(L_{x}\right),=H(E,μ,Lz)δ(Ly)δ(Lx),
corresponds to a cluster of stars with orbits all in the equatorial plane θ = θ = theta=\theta=θ= π / 2 π / 2 pi//2\pi / 2π/2 ( L y = L x = 0 L y = L x = 0 L_(y)=L_(x)=0L_{y}=L_{x}=0Ly=Lx=0 for all stars in cluster). To be spherical the cluster's distribution function must depend only on the magnitude
L = ( L x 2 + L y 2 + L z 2 ) 1 / 2 L = L x 2 + L y 2 + L z 2 1 / 2 L=(L_(x)^(2)+L_(y)^(2)+L_(z)^(2))^(1//2)L=\left(L_{x}{ }^{2}+L_{y}{ }^{2}+L_{z}^{2}\right)^{1 / 2}L=(Lx2+Ly2+Lz2)1/2
of the angular momentum, and not on its direction (not on the orientation of a star's orbital plane). Thus, the general spherical solution to the Vlasoff equation in a static, spherical spacetime must have the form
(25.78) r = F ( E , L , μ ) . (25.78) r = F ( E , L , μ ) . {:(25.78)r=F(E","L","mu).:}\begin{equation*} \mathscr{r}=F(E, L, \mu) . \tag{25.78} \end{equation*}(25.78)r=F(E,L,μ).
To use this general solution, one must reexpress the constants of the motion E E EEE, L , μ L , μ L,muL, \muL,μ, in terms of the agreed-on phase-space coordinates ( t , r , θ , ϕ , p 0 ^ , p γ ^ , p θ ^ , p ϕ ^ ) t , r , θ , ϕ , p 0 ^ , p γ ^ , p θ ^ , p ϕ ^ (t,r,theta,phi,p^( hat(0)),p^( hat(gamma)),p^( hat(theta)),p^( hat(phi)))\left(t, r, \theta, \phi, p^{\hat{0}}, p^{\hat{\gamma}}, p^{\hat{\theta}}, p^{\hat{\phi}}\right)(t,r,θ,ϕ,p0^,pγ^,pθ^,pϕ^). The rest mass of a star is given by (25.77b). The energy-at-infinity is obtained by redshifting the locally measured energy
(25.79a) E = p 0 = e ϕ p 0 ^ (25.79a) E = p 0 = e ϕ p 0 ^ {:(25.79a)E=-p_(0)=e^(phi)p^( hat(0)):}\begin{equation*} E=-p_{0}=e^{\phi} p^{\hat{0}} \tag{25.79a} \end{equation*}(25.79a)E=p0=eϕp0^
For an orbit in the equatorial plane ( p θ = p θ = p θ ^ = 0 ; L x = L y = 0 ) p θ = p θ = p θ ^ = 0 ; L x = L y = 0 (p_(theta)=p^(theta)=p^( hat(theta))=0;L_(x)=L_(y)=0)\left(p_{\theta}=p^{\theta}=p^{\hat{\theta}}=0 ; L_{x}=L_{y}=0\right)(pθ=pθ=pθ^=0;Lx=Ly=0), the total angular momentum has the form
L = | L z | = | p ϕ | = | r p ϕ ^ | = r × ("tangential" component of 4-momentum). L = L z = p ϕ = r p ϕ ^ = r ×  ("tangential" component of 4-momentum).  L=|L_(z)|=|p_(phi)|=|rp^( hat(phi))|=r xx" ("tangential" component of 4-momentum). "L=\left|L_{z}\right|=\left|p_{\phi}\right|=\left|r p^{\hat{\phi}}\right|=r \times \text { ("tangential" component of 4-momentum). }L=|Lz|=|pϕ|=|rpϕ^|=r× ("tangential" component of 4-momentum). 
By symmetry, the equation L = r × L = r × L=r xxL=r \timesL=r× ("tangential" component of p p p\boldsymbol{p}p ) must hold true also for orbits in other planes; it must be perfectly general:
(25.79b) L = r p r ^ , (25.79b) L = r p r ^ , {:(25.79b)L=rp^( hat(r))",":}\begin{equation*} L=r p^{\hat{r}}, \tag{25.79b} \end{equation*}(25.79b)L=rpr^,
p r ^ ( p r ^ ( p^( hat(r))-=(p^{\hat{r}} \equiv(pr^( tangential component of 4-momentum ) = [ ( p θ ^ ) 2 + ( p ϕ ^ ) 2 ] 1 / 2 ) = p θ ^ 2 + p ϕ ^ 2 1 / 2 )=[(p^( hat(theta)))^(2)+(p^( hat(phi)))^(2)]^(1//2))=\left[\left(p^{\hat{\theta}}\right)^{2}+\left(p^{\hat{\phi}}\right)^{2}\right]^{1 / 2})=[(pθ^)2+(pϕ^)2]1/2
(see exercise 25.9).
(3) "smeared-out" stress-energy tensor due to stars
Before solving the Einstein field equations, one finds it useful to reduce the stress-energy tensor to a more explicit form than (25.76b). The off-diagonal components T 0 ^ j ^ T 0 ^ j ^ T^( hat(0) hat(j))T^{\hat{0} \hat{j}}T0^j^ and T j ^ k ^ ( j k ) T j ^ k ^ ( j k ) T^( hat(j) hat(k))(j!=k)T^{\hat{j} \hat{k}}(j \neq k)Tj^k^(jk) all vanish because their integrands are odd functions of p i ^ p i ^ p^( hat(i))p^{\hat{i}}pi^. The integrands for the diagonal components T 0 ^ 0 ^ , T r ^ T 0 ^ 0 ^ , T r ^ T^( hat(0) hat(0)),T^( hat(r))T^{\hat{0} \hat{0}}, T^{\hat{r}}T0^0^,Tr^, and 1 2 ( T θ ^ θ ^ + T ϕ ^ ϕ ^ ) 1 2 T θ ^ θ ^ + T ϕ ^ ϕ ^ (1)/(2)(T^( hat(theta) hat(theta))+T^( hat(phi) hat(phi)))\frac{1}{2}\left(T^{\hat{\theta} \hat{\theta}}+T^{\hat{\phi} \hat{\phi}}\right)12(Tθ^θ^+Tϕ^ϕ^) are independent of angle Θ tan 1 ( p ϕ ^ / p θ ^ ) Θ tan 1 p ϕ ^ / p θ ^ Theta-=tan^(-1)(p^( hat(phi))//p^( hat(theta)))\Theta \equiv \tan ^{-1}\left(p^{\hat{\phi}} / p^{\hat{\theta}}\right)Θtan1(pϕ^/pθ^) in the tangential momentum plane; so the momentum volume element can be rewritten as
d p 0 ^ d p r ^ d p θ ^ d p ϕ ^ 2 π p r ^ d p r ^ d p r ^ d p 0 ^ d p 0 ^ d p r ^ d p θ ^ d p ϕ ^ 2 π p r ^ d p r ^ d p r ^ d p 0 ^ dp^( hat(0))dp^( hat(r))dp^( hat(theta))dp^( hat(phi))longrightarrow2pip^( hat(r))dp^( hat(r))dp^( hat(r))dp^( hat(0))d p^{\hat{0}} d p^{\hat{r}} d p^{\hat{\theta}} d p^{\hat{\phi}} \longrightarrow 2 \pi p^{\hat{r}} d p^{\hat{r}} d p^{\hat{r}} d p^{\hat{0}}dp0^dpr^dpθ^dpϕ^2πpr^dpr^dpr^dp0^
Changing variables from ( p r ^ , p r ^ , p γ ^ p r ^ , p r ^ , p γ ^ p^( hat(r)),p^( hat(r)),p^( hat(gamma))p^{\hat{r}}, p^{\hat{r}}, p^{\hat{\gamma}}pr^,pr^,pγ^ ) to ( p r ^ , μ , p 0 ^ ) p r ^ , μ , p 0 ^ (p^( hat(r)),mu,p^( hat(0)))\left(p^{\hat{r}}, \mu, p^{\hat{0}}\right)(pr^,μ,p0^) where
μ = [ ( p θ ^ ) 2 ( p ı ^ ) 2 ( p T ^ ) 2 ] 1 / 2 , μ = p θ ^ 2 p ı ^ 2 p T ^ 2 1 / 2 , mu=[(p^( hat(theta)))^(2)-(p^( hat(ı)))^(2)-(p^( hat(T)))^(2)]^(1//2),\mu=\left[\left(p^{\hat{\theta}}\right)^{2}-\left(p^{\hat{\imath}}\right)^{2}-\left(p^{\hat{T}}\right)^{2}\right]^{1 / 2},μ=[(pθ^)2(pı^)2(pT^)2]1/2,
and recognizing that two values of p r ^ ( ± p r ^ ) p r ^ ± p r ^ p^( hat(r))(+-p^( hat(r)))p^{\hat{r}}\left( \pm p^{\hat{r}}\right)pr^(±pr^) correspond to each value of μ μ mu\muμ, one brings the volume element into the form
2 π p r ^ d p r ^ d p r ^ d p 0 ^ 4 π ( p r ^ μ / p r ^ ) d p r ^ d p 0 ^ d μ 2 π p r ^ d p r ^ d p r ^ d p 0 ^ 4 π p r ^ μ / p r ^ d p r ^ d p 0 ^ d μ 2pip^( hat(r))dp^( hat(r))dp^( hat(r))dp^( hat(0))longrightarrow4pi(p^( hat(r))mu//p^( hat(r)))dp^( hat(r))dp^( hat(0))d mu2 \pi p^{\hat{r}} d p^{\hat{r}} d p^{\hat{r}} d p^{\hat{0}} \longrightarrow 4 \pi\left(p^{\hat{r}} \mu / p^{\hat{r}}\right) d p^{\hat{r}} d p^{\hat{0}} d \mu2πpr^dpr^dpr^dp0^4π(pr^μ/pr^)dpr^dp0^dμ
The diagonal components of T T T\boldsymbol{T}T [equation (25.76b)] then read
(25.81a) ρ T 0 ^ 0 ^ = ( total density of mass-energy ) = 4 π F ( e ϕ p 0 ^ , r p r ^ , μ ) ( p 0 ^ 2 p r ^ / p r ^ ) d p r ^ d p 0 ^ d μ , (25.81b) P T 1 2 ( T θ ^ ^ θ ^ + T ϕ ^ ϕ ^ ) = T θ ^ ^ θ ^ = T ϕ ^ ϕ ^ = ( tangential pressure ) = 2 π F ( e ϕ p 0 ^ , r p r ^ , μ ) [ ( p r ^ ) 3 / p r ^ ] d p r ^ d p o ^ d μ , P r T r ^ r ^ = ( radial pressure ) (25.81c) = 4 π F ( e ϕ p 0 ^ , r p r ^ , μ ) ( p γ ^ p r ^ ) d p r ^ d p 0 ^ d μ . (25.81a) ρ T 0 ^ 0 ^ = (  total density of mass-energy  ) = 4 π F e ϕ p 0 ^ , r p r ^ , μ p 0 ^ 2 p r ^ / p r ^ d p r ^ d p 0 ^ d μ , (25.81b) P T 1 2 T θ ^ ^ θ ^ + T ϕ ^ ϕ ^ = T θ ^ ^ θ ^ = T ϕ ^ ϕ ^ = (  tangential pressure  ) = 2 π F e ϕ p 0 ^ , r p r ^ , μ p r ^ 3 / p r ^ d p r ^ d p o ^ d μ , P r T r ^ r ^ = (  radial pressure  ) (25.81c) = 4 π F e ϕ p 0 ^ , r p r ^ , μ p γ ^ p r ^ d p r ^ d p 0 ^ d μ . {:[(25.81a)rho-=T^( hat(0) hat(0))=(" total density of mass-energy ")],[=4pi int F(e^(phi)p^( hat(0)),rp^( hat(r)),mu)(p^( hat(0)2)p^( hat(r))//p^( hat(r)))dp^( hat(r))dp^( hat(0))d mu","],[(25.81b)P_(T)-=(1)/(2)(T^( hat(hat(theta)) hat(theta))+T^( hat(phi) hat(phi)))=T^( hat(hat(theta)) hat(theta))=T^( hat(phi) hat(phi))=(" tangential pressure ")],[=2pi int F(e^(phi)p^( hat(0)),rp^( hat(r)),mu)[(p^( hat(r)))^(3)//p^( hat(r))]dp^( hat(r))dp^( hat(o))d mu","],[P_(r)-=T^( hat(r) hat(r))=(" radial pressure ")],[(25.81c)=4pi int F(e^(phi)p^( hat(0)),rp^( hat(r)),mu)(p^( hat(gamma))p^( hat(r)))dp^( hat(r))dp^( hat(0))d mu.]:}\begin{align*} & \rho \equiv T^{\hat{0} \hat{0}}=(\text { total density of mass-energy }) \tag{25.81a}\\ &=4 \pi \int F\left(e^{\phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left(p^{\hat{0} 2} p^{\hat{r}} / p^{\hat{r}}\right) d p^{\hat{r}} d p^{\hat{0}} d \mu, \\ & P_{T} \equiv \frac{1}{2}\left(T^{\hat{\hat{\theta}} \hat{\theta}}+T^{\hat{\phi} \hat{\phi}}\right)=T^{\hat{\hat{\theta}} \hat{\theta}}=T^{\hat{\phi} \hat{\phi}}=(\text { tangential pressure }) \tag{25.81b}\\ &=2 \pi \int F\left(e^{\phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left[\left(p^{\hat{r}}\right)^{3} / p^{\hat{r}}\right] d p^{\hat{r}} d p^{\hat{o}} d \mu, \\ & P_{r} \equiv T^{\hat{r} \hat{r}}=(\text { radial pressure }) \\ &=4 \pi \int F\left(e^{\phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left(p^{\hat{\gamma}} p^{\hat{r}}\right) d p^{\hat{r}} d p^{\hat{0}} d \mu . \tag{25.81c} \end{align*}(25.81a)ρT0^0^=( total density of mass-energy )=4πF(eϕp0^,rpr^,μ)(p0^2pr^/pr^)dpr^dp0^dμ,(25.81b)PT12(Tθ^^θ^+Tϕ^ϕ^)=Tθ^^θ^=Tϕ^ϕ^=( tangential pressure )=2πF(eϕp0^,rpr^,μ)[(pr^)3/pr^]dpr^dpo^dμ,PrTr^r^=( radial pressure )(25.81c)=4πF(eϕp0^,rpr^,μ)(pγ^pr^)dpr^dp0^dμ.
When performing these integrals, one must express p γ ^ p γ ^ p^( hat(gamma))p^{\hat{\gamma}}pγ^ in terms of the variables of integration,
(25.81d) p r ^ = [ ( p ^ ^ ) 2 ( p t ^ ) 2 μ 2 ] 1 / 2 (25.81d) p r ^ = p ^ ^ 2 p t ^ 2 μ 2 1 / 2 {:(25.81d)p^( hat(r))=[(p^( hat(hat())))^(2)-(p^( hat(t)))^(2)-mu^(2)]^(1//2):}\begin{equation*} p^{\hat{r}}=\left[\left(p^{\hat{\hat{}}}\right)^{2}-\left(p^{\hat{t}}\right)^{2}-\mu^{2}\right]^{1 / 2} \tag{25.81d} \end{equation*}(25.81d)pr^=[(p^^)2(pt^)2μ2]1/2
The Einstein field equations for this stress-energy tensor and the metric (25.73), after use of expressions (14.43) for G α ^ β ^ G α ^ β ^ G^( hat(alpha) hat(beta))G^{\hat{\alpha} \hat{\beta}}Gα^β^ and after manipulations analogous to those for a spherical star ( $ 23.5 $ 23.5 $23.5\$ 23.5$23.5 ), reduce to
(25.82a) e 2 Λ = ( 1 2 m / r ) 1 , m = 0 r 4 π r 2 ρ d r ; (25.82b) d Φ d r = m + 4 π r 3 P r r ( r 2 m ) . (25.82a) e 2 Λ = ( 1 2 m / r ) 1 , m = 0 r 4 π r 2 ρ d r ; (25.82b) d Φ d r = m + 4 π r 3 P r r ( r 2 m ) . {:[(25.82a)e^(2Lambda)=(1-2m//r)^(-1)","quad m=int_(0)^(r)4pir^(2)rho dr;],[(25.82b)(d Phi)/(dr)=(m+4pir^(3)P_(r))/(r(r-2m)).]:}\begin{gather*} e^{2 \Lambda}=(1-2 m / r)^{-1}, \quad m=\int_{0}^{r} 4 \pi r^{2} \rho d r ; \tag{25.82a}\\ \frac{d \Phi}{d r}=\frac{m+4 \pi r^{3} P_{r}}{r(r-2 m)} . \tag{25.82b} \end{gather*}(25.82a)e2Λ=(12m/r)1,m=0r4πr2ρdr;(25.82b)dΦdr=m+4πr3Prr(r2m).
These equations, together with the assumed form F ( E , L , μ ) F ( E , L , μ ) F(E,L,mu)F(E, L, \mu)F(E,L,μ) of the distribution
function and the integrals (25.81) for ρ , P r ρ , P r rho,P_(r)\rho, P_{r}ρ,Pr, and P T P T P_(T)P_{T}PT, determine the structure of the cluster. Box 25.8 gives an overview of these structure equations, and specializes them for an isotropic velocity distribution. Box 25.9 presents and discusses the solution to the equations for an isothermal star cluster (truncated Maxwellian velocity distribution).

Exercise 25.27. ISOTROPIC STAR CLUSTER

EXERCISES

For a cluster with distribution function independent of angular momentum, derive properties B. 1 to B. 6 of Box 25.8 .
Exercise 25.28. SELF-SIMILAR CLUSTER [See Bisnovatyi-Kogan and Zel'dovich (1969), Bisnovatyi-Kogan and Thorne (1970).]
(a) Find a solution to the equations of structure for a spherical star of infinite central density, with the equation of state P = γ ρ P = γ ρ P=gamma rhoP=\gamma \rhoP=γρ, where γ γ gamma\gammaγ is a constant ( 0 < γ < 1 / 3 ) ( 0 < γ < 1 / 3 ) (0 < gamma < 1//3)(0<\gamma<1 / 3)(0<γ<1/3).
(b) Find an isotropic distribution function F ( E , μ ) F ( E , μ ) F(E,mu)F(E, \mu)F(E,μ) that leads to a star cluster with the same distributions of ρ , P , m ρ , P , m rho,P,m\rho, P, mρ,P,m, and Φ Φ Phi\PhiΦ as in the gas sphere of part (a). (See Box 25.8.) [Answer:
P = γ ρ = γ 2 1 + 6 γ + γ 2 1 2 π r 2 , e 2 A = ( 1 2 m / r ) 1 = ( 1 + 6 γ + γ 2 ) / ( 1 + γ ) 2 , e 2 Φ = B r 4 γ / ( 1 + γ ) , B = const; F = A ( E / B 1 / 2 ) ( 1 + γ ) / γ δ ( μ μ 0 ) = A r 2 ( E local ) ( 1 + γ ) / γ , A = const. ] P = γ ρ = γ 2 1 + 6 γ + γ 2 1 2 π r 2 , e 2 A = ( 1 2 m / r ) 1 = 1 + 6 γ + γ 2 / ( 1 + γ ) 2 , e 2 Φ = B r 4 γ / ( 1 + γ ) , B =  const;  F = A E / B 1 / 2 ( 1 + γ ) / γ δ μ μ 0 = A r 2 E local  ( 1 + γ ) / γ , A =  const.  {:[P=gamma rho=(gamma^(2))/(1+6gamma+gamma^(2))(1)/(2pir^(2))","],[e^(2A)=(1-2m//r)^(-1)=(1+6gamma+gamma^(2))//(1+gamma)^(2)","],[e^(2Phi)=Br^(4gamma//(1+gamma))","quad B=" const; "],[{:F=A(E//B^(1//2))^(-(1+gamma)//gamma)delta(mu-mu_(0))=Ar^(-2)(E_("local "))^(-(1+gamma)//gamma),quad A=" const. "]]:}\begin{gathered} P=\gamma \rho=\frac{\gamma^{2}}{1+6 \gamma+\gamma^{2}} \frac{1}{2 \pi r^{2}}, \\ e^{2 A}=(1-2 m / r)^{-1}=\left(1+6 \gamma+\gamma^{2}\right) /(1+\gamma)^{2}, \\ e^{2 \Phi}=B r^{4 \gamma /(1+\gamma)}, \quad B=\text { const; } \\ \left.F=A\left(E / B^{1 / 2}\right)^{-(1+\gamma) / \gamma} \delta\left(\mu-\mu_{0}\right)=A r^{-2}\left(E_{\text {local }}\right)^{-(1+\gamma) / \gamma}, \quad A=\text { const. }\right] \end{gathered}P=γρ=γ21+6γ+γ212πr2,e2A=(12m/r)1=(1+6γ+γ2)/(1+γ)2,e2Φ=Br4γ/(1+γ),B= const; F=A(E/B1/2)(1+γ)/γδ(μμ0)=Ar2(Elocal )(1+γ)/γ,A= const. ]

Exercise 25.29. CLUSTER WITH CIRCULAR ORBITS

What must be the form of the distribution function to guarantee that all stars move in circular orbits? Specialize the equations of structure to this case. Analyze the stability of the orbits of individual stars in the cluster, using an effective-potential diagram. What conditions must the distribution function satisfy if all orbits are to be stable? [See Einstein (1939), Zapolsky (1968).]

Box 25.8 EQUATIONS OF STRUCTURE FOR A SPHERICAL STAR CLUSTER

A. To Build a Model for a Star Cluster, Proceed as Follows

  1. Specify the distribution function = F ( E , L , μ ) = F ( E , L , μ ) =F(E,L,mu)\mathscr{\mathscr { ~ }}=F(E, L, \mu) =F(E,L,μ), where
E = energy-at-infinity of a star L = angular momentum of a star, μ = rest mass of a star. E =  energy-at-infinity of a star  L =  angular momentum of a star,  μ =  rest mass of a star.  {:[E=" energy-at-infinity of a star "],[L=" angular momentum of a star, "],[mu=" rest mass of a star. "]:}\begin{aligned} & E=\text { energy-at-infinity of a star } \\ & L=\text { angular momentum of a star, } \\ & \mu=\text { rest mass of a star. } \end{aligned}E= energy-at-infinity of a star L= angular momentum of a star, μ= rest mass of a star. 
  1. Solve the following two integro-differential equations for the metric functions m = 1 2 r ( 1 e 2 Λ ) m = 1 2 r 1 e 2 Λ m=(1)/(2)r(1-e^(-2Lambda))m=\frac{1}{2} r\left(1-e^{-2 \Lambda}\right)m=12r(1e2Λ) and Φ Φ Phi\PhiΦ of the line element
Box 25.8 (continued)
d s 2 = e 2 Φ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 m = 0 r 4 π r 2 ρ d r d Φ d r = m + 4 π r 3 P r r ( r 2 m ) d s 2 = e 2 Φ d t 2 + e 2 Λ d r 2 + r 2 d Ω 2 m = 0 r 4 π r 2 ρ d r d Φ d r = m + 4 π r 3 P r r ( r 2 m ) {:[ds^(2)=-e^(2Phi)dt^(2)+e^(2Lambda)dr^(2)+r^(2)dOmega^(2)],[m=int_(0)^(r)4pir^(2)rho dr],[(d Phi)/(dr)=(m+4pir^(3)P_(r))/(r(r-2m))]:}\begin{gathered} d s^{2}=-e^{2 \Phi} d t^{2}+e^{2 \Lambda} d r^{2}+r^{2} d \Omega^{2} \\ m=\int_{0}^{r} 4 \pi r^{2} \rho d r \\ \frac{d \Phi}{d r}=\frac{m+4 \pi r^{3} P_{r}}{r(r-2 m)} \end{gathered}ds2=e2Φdt2+e2Λdr2+r2dΩ2m=0r4πr2ρdrdΦdr=m+4πr3Prr(r2m)
where
ρ = 4 π F ( e ϕ p 0 ^ , r p r ^ , μ ) [ ( p o ^ ) 2 p r ^ / p r ^ ] d p r ^ d p 0 ^ d μ , P T = 2 π F ( e Φ p 0 ^ , r p r ^ , μ ) [ ( p r ^ ) 3 / p r ^ ] d p r ^ d p 0 ^ d μ , P r = 4 π F ( e ϕ p 0 ^ , r p r ^ , μ ) ( p r ^ p r ^ ) d p r ^ d p ^ ^ d μ , p r ^ = [ ( p o ^ ) 2 ( p r ^ ) 2 μ 2 ] 1 / 2 . ρ = 4 π F e ϕ p 0 ^ , r p r ^ , μ p o ^ 2 p r ^ / p r ^ d p r ^ d p 0 ^ d μ , P T = 2 π F e Φ p 0 ^ , r p r ^ , μ p r ^ 3 / p r ^ d p r ^ d p 0 ^ d μ , P r = 4 π F e ϕ p 0 ^ , r p r ^ , μ p r ^ p r ^ d p r ^ d p ^ ^ d μ , p r ^ = p o ^ 2 p r ^ 2 μ 2 1 / 2 . {:[rho=4pi int F(e^(phi)p^( hat(0)),rp^( hat(r)),mu)[(p^( hat(o)))^(2)p^( hat(r))//p^( hat(r))]dp^( hat(r))dp^( hat(0))d mu","],[P_(T)=2pi int F(e^(Phi)p^( hat(0)),rp^( hat(r)),mu)[(p^( hat(r)))^(3)//p^( hat(r))]dp^( hat(r))dp^( hat(0))d mu","],[P_(r)=4pi int F(e^(phi)p^( hat(0)),rp^( hat(r)),mu)(p^( hat(r))p^( hat(r)))dp^( hat(r))dp^( hat(hat()))d mu","],[p^( hat(r))=[(p^( hat(o)))^(2)-(p^( hat(r)))^(2)-mu^(2)]^(1//2).]:}\begin{gathered} \rho=4 \pi \int F\left(e^{\phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left[\left(p^{\hat{o}}\right)^{2} p^{\hat{r}} / p^{\hat{r}}\right] d p^{\hat{r}} d p^{\hat{0}} d \mu, \\ P_{T}=2 \pi \int F\left(e^{\Phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left[\left(p^{\hat{r}}\right)^{3} / p^{\hat{r}}\right] d p^{\hat{r}} d p^{\hat{0}} d \mu, \\ P_{r}=4 \pi \int F\left(e^{\phi} p^{\hat{0}}, r p^{\hat{r}}, \mu\right)\left(p^{\hat{r}} p^{\hat{r}}\right) d p^{\hat{r}} d p^{\hat{\hat{}}} d \mu, \\ p^{\hat{r}}=\left[\left(p^{\hat{o}}\right)^{2}-\left(p^{\hat{r}}\right)^{2}-\mu^{2}\right]^{1 / 2} . \end{gathered}ρ=4πF(eϕp0^,rpr^,μ)[(po^)2pr^/pr^]dpr^dp0^dμ,PT=2πF(eΦp0^,rpr^,μ)[(pr^)3/pr^]dpr^dp0^dμ,Pr=4πF(eϕp0^,rpr^,μ)(pr^pr^)dpr^dp^^dμ,pr^=[(po^)2(pr^)2μ2]1/2.
The integrations for ρ , P T ρ , P T rho,P_(T)\rho, P_{T}ρ,PT, and P r P r P_(r)P_{r}Pr go over all positive p r ^ , p 0 ^ , μ p r ^ , p 0 ^ , μ p^( hat(r)),p^( hat(0)),mup^{\hat{r}}, p^{\hat{0}}, \mupr^,p0^,μ for which ( p δ ^ ) 2 ( p T ^ ) 2 μ 2 0 p δ ^ 2 p T ^ 2 μ 2 0 (p^( hat(delta)))^(2)-(p^( hat(T)))^(2)-mu^(2) >= 0\left(p^{\hat{\delta}}\right)^{2}-\left(p^{\hat{T}}\right)^{2}-\mu^{2} \geq 0(pδ^)2(pT^)2μ20.

B. If the Distribution Function is Independent

of Angular Momentum, Then
  1. F = F ( E , μ ) F = F ( E , μ ) F=F(E,mu)F=F(E, \mu)F=F(E,μ).
  2. The distribution of stellar velocities at each point in the cluster is isotropic.
  3. ρ = 4 π F ( e ϕ p θ ^ , μ ) [ ( p θ ^ ) 2 μ 2 ] 1 / 2 ( p θ ^ ) 2 d p θ ^ d μ ρ = 4 π F e ϕ p θ ^ , μ p θ ^ 2 μ 2 1 / 2 p θ ^ 2 d p θ ^ d μ rho=4pi int F(e^(phi)p^( hat(theta)),mu)[(p^( hat(theta)))^(2)-mu^(2)]^(1//2)(p^( hat(theta)))^(2)dp^( hat(theta))d mu\rho=4 \pi \int F\left(e^{\phi} p^{\hat{\theta}}, \mu\right)\left[\left(p^{\hat{\theta}}\right)^{2}-\mu^{2}\right]^{1 / 2}\left(p^{\hat{\theta}}\right)^{2} d p^{\hat{\theta}} d \muρ=4πF(eϕpθ^,μ)[(pθ^)2μ2]1/2(pθ^)2dpθ^dμ.
  4. The pressure is isotropic:
P r = P T P 4 π 3 F ( e ϕ p 0 ^ , μ ) ( p 0 ^ 2 μ 2 ) 3 / 2 d p 0 ^ d μ P r = P T P 4 π 3 F e ϕ p 0 ^ , μ p 0 ^ 2 μ 2 3 / 2 d p 0 ^ d μ P_(r)=P_(T)-=P-=(4pi)/(3)int F(e^(phi)p^( hat(0)),mu)(p^( hat(0)2)-mu^(2))^(3//2)dp^( hat(0))d muP_{r}=P_{T} \equiv P \equiv \frac{4 \pi}{3} \int F\left(e^{\phi} p^{\hat{0}}, \mu\right)\left(p^{\hat{0} 2}-\mu^{2}\right)^{3 / 2} d p^{\hat{0}} d \muPr=PTP4π3F(eϕp0^,μ)(p0^2μ2)3/2dp0^dμ
  1. The total density of mass-energy ρ ρ rho\rhoρ, the pressure P P PPP, and the metric functions ϕ ϕ phi\phiϕ and m = 1 2 r ( 1 e 2 Λ ) m = 1 2 r 1 e 2 Λ m=(1)/(2)r(1-e^(-2Lambda))m=\frac{1}{2} r\left(1-e^{-2 \Lambda}\right)m=12r(1e2Λ) satisfy the equations of structure for a gas sphere ("star"),
m = 4 π r 2 ρ d r d Φ d r = m + 4 π r 3 P r ( r 2 m ) , d P d r = ( ρ + P ) ( m + 4 π r 3 P ) r ( r 2 m ) . m = 4 π r 2 ρ d r d Φ d r = m + 4 π r 3 P r ( r 2 m ) , d P d r = ( ρ + P ) m + 4 π r 3 P r ( r 2 m ) . {:[m=int4pir^(2)rho dr],[(d Phi)/(dr)=(m+4pir^(3)P)/(r(r-2m))","],[(dP)/(dr)=-((rho+P)(m+4pir^(3)P))/(r(r-2m)).]:}\begin{gathered} m=\int 4 \pi r^{2} \rho d r \\ \frac{d \Phi}{d r}=\frac{m+4 \pi r^{3} P}{r(r-2 m)}, \\ \frac{d P}{d r}=-\frac{(\rho+P)\left(m+4 \pi r^{3} P\right)}{r(r-2 m)} . \end{gathered}m=4πr2ρdrdΦdr=m+4πr3Pr(r2m),dPdr=(ρ+P)(m+4πr3P)r(r2m).
  1. Thus, to every static, spherical star cluster with isotropic velocity distribution, there corresponds a unique gas sphere that has the same distributions of ρ , P , m ρ , P , m rho,P,m\rho, P, mρ,P,m, and Φ Φ Phi\PhiΦ.
  2. Conversely [see Fackerell (1968)], given a gas sphere (solution to equations of stellar structure for ρ , P , m ρ , P , m rho,P,m\rho, P, mρ,P,m, and Φ Φ Phi\PhiΦ ), one can always find a distribution function F ( E , μ ) F ( E , μ ) F(E,mu)F(E, \mu)F(E,μ) that describes a cluster with the same ρ , P , m ρ , P , m rho,P,m\rho, P, mρ,P,m, and Φ Φ Phi\PhiΦ. But for some gas spheres F F FFF is necessarily negative in part of phase space, and is thus unphysical.

Box 25.9 ISOTHERMAL STAR CLUSTERS

A. Distribution Function

  1. In any relativistic star cluster, one might expect that occasional close encounters between stars would "thermalize" the stellar distribution function. This suggests that one study isotropic, spherical clusters with the Boltzmann distribution function (tacitly assumed zero for p 0 ^ = E e ϕ < μ 0 p 0 ^ = E e ϕ < μ 0 p^( hat(0))=Ee^(-phi) < mu_(0)p^{\hat{0}}=E e^{-\phi}<\mu_{0}p0^=Eeϕ<μ0 )
(1) ϰ = F ( E , L , μ ) = K e E / T δ ( μ μ 0 ) . (1) ϰ = F ( E , L , μ ) = K e E / T δ μ μ 0 . {:(1)ϰ=F(E","L","mu)=Ke^(-E//T)delta(mu-mu_(0)).:}\begin{equation*} \mathscr{\varkappa}=F(E, L, \mu)=K e^{-E / T} \delta\left(\mu-\mu_{0}\right) . \tag{1} \end{equation*}(1)ϰ=F(E,L,μ)=KeE/Tδ(μμ0).
Here K K KKK is a normalization constant, T T TTT is a constant "temperature," and for simplicity the stars are all assumed to have the same rest mass μ 0 μ 0 mu_(0)\mu_{0}μ0.
2. In such a cluster, an observer at radius r r rrr sees a star of energy-at-infinity E E EEE to have locally measured energy
(2) p 0 ^ = ( rest mass-energy ) + ( kinetic energy ) = μ 0 ( 1 v 2 ) 1 / 2 = E e Φ ( r ) . (2) p 0 ^ = (  rest mass-energy  ) + (  kinetic energy  ) = μ 0 1 v 2 1 / 2 = E e Φ ( r ) . {:(2)p^( hat(0))=(" rest mass-energy ")+(" kinetic energy ")=(mu_(0))/((1-v^(2))^(1//2))=Ee^(-Phi(r)).:}\begin{equation*} p^{\hat{0}}=(\text { rest mass-energy })+(\text { kinetic energy })=\frac{\mu_{0}}{\left(1-v^{2}\right)^{1 / 2}}=E e^{-\Phi(r)} . \tag{2} \end{equation*}(2)p0^=( rest mass-energy )+( kinetic energy )=μ0(1v2)1/2=EeΦ(r).
Consequently, the stars in his neighborhood have a Boltzmann distribution
(3) d N d 3 p ^ d 3 x ^ d μ = π = K exp ( p o ^ / T loc ) δ ( μ μ 0 ) (3) d N d 3 p ^ d 3 x ^ d μ = π = K exp p o ^ / T loc δ μ μ 0 {:(3)(dN)/(d^(3)( hat(p))d^(3)( hat(x))d mu)=pi=K exp(-p^( hat(o))//T_(loc))delta(mu-mu_(0)):}\begin{equation*} \frac{d N}{d^{3} \hat{p} d^{3} \hat{x} d \mu}=\mathscr{\pi}=K \exp \left(-p^{\hat{o}} / T_{\mathrm{loc}}\right) \delta\left(\mu-\mu_{0}\right) \tag{3} \end{equation*}(3)dNd3p^d3x^dμ=π=Kexp(po^/Tloc)δ(μμ0)
with locally measured temperature
(4) T loc ( r ) = T e ϕ ( r ) . (4) T loc ( r ) = T e ϕ ( r ) . {:(4)T_(loc)(r)=Te^(-phi(r)).:}\begin{equation*} T_{\mathrm{loc}}(r)=T e^{-\phi(r)} . \tag{4} \end{equation*}(4)Tloc(r)=Teϕ(r).
Thus, the temperature of the cluster is subject to identically the same red-shift-blueshift effects as photons, particles, and stars that move about in the cluster. (For a derivation of this same temperature-redshift law for a gas in thermal equilibrium, see part (e) of exercise 22.7.)
3. Actually, the Boltzmann distribution (1) can never be achieved. Stars with E > μ 0 E > μ 0 E > mu_(0)E>\mu_{0}E>μ0 are gravitationally unbound from the cluster and will escape. The Boltzmann distribution presumes that, as such stars go zooming off toward r = r = r=oor=\inftyr=, an equal number of stars with the same energies come zooming in from r = r = r=oor=\inftyr= to maintain an unchanged distribution function. Such a situation is clearly unrealistic. Instead, one expects the escape of stars to truncate the distribution at some energy E max E max E_(max)E_{\max }Emax slightly less than μ 0 μ 0 mu_(0)\mu_{0}μ0. The result, in idealized form, is the "truncated Boltzmann distribution"
(5) ϰ = F ( E , L , μ ) = { K e E / T δ ( μ μ 0 ) , E < E max , 0 , E > E max (5) ϰ = F ( E , L , μ ) = K e E / T δ μ μ 0 , E < E max , 0 , E > E max {:(5)ϰ=F(E","L","mu)={[Ke^(-E//T)delta(mu-mu_(0))",",E < E_(max)","],[0",",E > E_(max)]:}:}\mathscr{\varkappa}=F(E, L, \mu)=\left\{\begin{array}{cc} K e^{-E / T} \delta\left(\mu-\mu_{0}\right), & E<E_{\max }, \tag{5}\\ 0, & E>E_{\max } \end{array}\right.(5)ϰ=F(E,L,μ)={KeE/Tδ(μμ0),E<Emax,0,E>Emax
Box 25.9 (continued)

B. Structure and Stability of Cluster Models

  1. Models for star clusters with truncated Boltzmann distributions have been constructed by Zel'dovich and Podurets (1965), by Fackerell (1966), and by Ipser (1969), using the procedure of Box 25.8. Ipser has analyzed the collisionless radial vibrations of such clusters.
  2. In general, these clusters form a 4-parameter family ( K , T , μ 0 , E max K , T , μ 0 , E max  K,T,mu_(0),E_("max ")K, T, \mu_{0}, E_{\text {max }}K,T,μ0,Emax  ). Replace the parameter K K KKK by the total rest mass of the cluster, M 0 = μ 0 N M 0 = μ 0 N M_(0)=mu_(0)NM_{0}=\mu_{0} NM0=μ0N, where N N NNN is the total number of stars. Replace T T TTT by the temperature per unit rest mass, T ~ = T / μ 0 T ~ = T / μ 0 widetilde(T)=T//mu_(0)\widetilde{T}=T / \mu_{0}T~=T/μ0. Replace E max E max E_(max)E_{\max }Emax by the maximum energy per unit rest mass, E ~ max = E max / μ 0 E ~ max  = E max / μ 0 widetilde(E)_("max ")=E_(max)//mu_(0)\widetilde{E}_{\text {max }}=E_{\max } / \mu_{0}E~max =Emax/μ0. Then the clusters are parametrized by ( M 0 , T ~ , μ 0 , E ~ max ) M 0 , T ~ , μ 0 , E ~ max  (M_(0),( widetilde(T)),mu_(0), widetilde(E)_("max "))\left(M_{0}, \widetilde{T}, \mu_{0}, \widetilde{E}_{\text {max }}\right)(M0,T~,μ0,E~max ). When one now doubles μ 0 μ 0 mu_(0)\mu_{0}μ0, holding M 0 , T ~ , E ~ max M 0 , T ~ , E ~ max M_(0), widetilde(T), widetilde(E)_(max)M_{0}, \widetilde{T}, \widetilde{E}_{\max }M0,T~,E~max fixed (and thus halving the total number of stars), all macroscopic features of the cluster remain unchanged. In this sense μ 0 μ 0 mu_(0)\mu_{0}μ0 is a "trivial parameter" and can henceforth be ignored or changed at will. The total rest mass of the cluster M 0 M 0 M_(0)M_{0}M0 can be regarded as a "scaling factor"; all dimensionless features of the cluster are independent of it. For example, if ρ c ρ c rho_(c)\rho_{c}ρc is the central density of mass-energy [equation (25.81a), evaluated at r = 0 r = 0 r=0r=0r=0 ], then ρ c M 0 2 ρ c M 0 2 rho_(c)M_(0)^(2)\rho_{c} M_{0}{ }^{2}ρcM02 is dimensionless and is thus independent of M 0 M 0 M_(0)M_{0}M0, which means that ρ c M 0 2 ρ c M 0 2 rho_(c)propM_(0)^(-2)\rho_{c} \propto M_{0}{ }^{-2}ρcM02. Only two nontrivial parameters remain: T ~ T ~ widetilde(T)\widetilde{T}T~ and E ~ max E ~ max widetilde(E)_(max)\widetilde{E}_{\max }E~max.
  3. Consider as an instructive special case [Zel'dovich and Podurets (1965)] the one-parameter sequence with E ~ max = 1 1 2 T ~ E ~ max = 1 1 2 T ~ widetilde(E)_(max)=1-(1)/(2) widetilde(T)\widetilde{E}_{\max }=1-\frac{1}{2} \widetilde{T}E~max=112T~. The following figure, computed by Ipser (1969), plots for this sequence the fractional binding energy,
(6) E bind / M 0 ( M 0 M ) / M 0 (6) E bind  / M 0 M 0 M / M 0 {:(6)E_("bind ")//M_(0)-=(M_(0)-M)//M_(0):}\begin{equation*} E_{\text {bind }} / M_{0} \equiv\left(M_{0}-M\right) / M_{0} \tag{6} \end{equation*}(6)Ebind /M0(M0M)/M0
(here M M MMM is total mass-energy); the square of the angular frequency for collisionless vibrations (vibration amplitude e i ω t e i ω t prope^(-i omega t)\propto e^{-i \omega t}eiωt ) divided by central density of mass-energy, ω 2 / ρ c ω 2 / ρ c omega^(2)//rho_(c)\omega^{2} / \rho_{c}ω2/ρc; and the redshift, z c z c z_(c)z_{c}zc, of photons emitted from the center of the cluster and received at infinity. All these quantities are dimensionless, and thus depend only on the choice of T ~ = T / μ 0 T ~ = T / μ 0 widetilde(T)=T//mu_(0)\widetilde{T}=T / \mu_{0}T~=T/μ0.
4. Notice that all models beyond the point of maximum binding energy ( z c 0.5 ) z c 0.5 (z_(c) >= 0.5)\left(z_{c} \geq 0.5\right)(zc0.5) are unstable against collisionless radial perturbations ( ω ω omega\omegaω imaginary; amplitude of perturbation e | ω | t ) e | ω | t {: prope^(|omega|t))\left.\propto e^{|\omega| t}\right)e|ω|t). When perturbed slightly, such clusters must collapse to form black holes. (See Chapter 26 for an analysis of the analogous instability in stars).
5. These results suggest an idealized story of the evolution of a spherical cluster [Zel'dovich and Podurets (1965); Fackerell, Ipser, and Thorne (1969)]. The

cluster would evolve quasistatically along a sequence of spherical equilibrium configurations such as those of the figure. The evolution would be driven by stellar collisions and by the evaporation of stars. When two stars collide and coalesce, they increase the cluster's rest mass and hence its fractional binding energy. When a star gains enough energy from such encounters to escape from the cluster, it carries away excess kinetic energy, leaving the cluster more tightly bound. Thus, both collisions and evaporation should drive the cluster toward states of tighter and tighter binding. When the cluster reaches the point, along its sequence, of maximum fractional binding energy, it can no longer evolve quasistatically. Relativistic gravitational collapse sets in: the stars spiral inward through the gravitational radius of the cluster toward its center, leaving behind a black hole with, perhaps, some remaining stars orbiting it.
It is tempting to speculate that violent events in the nuclei of some galaxies and in quasars might be associated with the onset of such a collapse, or with encounters between an already collapsed cluster (black hole) and surrounding stars.

  1. *For more detailed treatments of this subject see, e.g., Stueckelberg and Wanders (1953), Kluitenberg and de Groot (1954), Meixner and Reik (1959), and references cited therein; see also the references on hydrodynamics cited at the beginning of $ 22.3 $ 22.3 $22.3\$ 22.3$22.3, and the references on kinetic theory cited at the beginning of $ 22.6 $ 22.6 $22.6\$ 22.6$22.6.
  2. *For more detailed treatments of this subject see, e.g., Ehlers (1961), Taub (1971), Ellis (1971), Lichnerowicz (1967), Cattaneo (1971), and references cited therein; see also the references on kinetic theory cited at the beginning of $ 22.6 $ 22.6 $22.6\$ 22.6$22.6.
  3. *Exercise supplied by John M. Stewart.
    • Based in part on notes prepared by William L. Burke at Caltech in 1968. For more detailed treatments of geometric optics in curved spacetime, see, e.g., Sachs (1961), Jordan, Ehlers, and Sachs (1961), and Robinson (1961); also references discussed and listed in $ 41.11 $ 41.11 $41.11\$ 41.11$41.11.
  4. *The equations for A A A\boldsymbol{A}A are linear. Therefore the analysis would proceed equally well assuming, instead of an amplitude independent of λ λ lambda\lambdaλ, a dominant term a λ n a λ n a proplambda^(n)\boldsymbol{a} \propto \lambda^{n}aλn, with b λ n + 1 , c λ n + 2 b λ n + 1 , c λ n + 2 b proplambda^(n+1),c proplambda^(n+2)\boldsymbol{b} \propto \lambda^{n+1}, \boldsymbol{c} \propto \lambda^{n+2}bλn+1,cλn+2, etc. The results are independent of n n nnn. Choosing n = 1 n = 1 n=1n=1n=1 would give field strengths F μ ν F μ ν F_(mu nu)F_{\mu \nu}Fμν and energy densities T μ ν F 2 T μ ν F 2 T_(mu nu)propF^(2)propT_{\mu \nu} \propto F^{2} \proptoTμνF2 A 2 / λ 2 A 2 / λ 2 A^(2)//lambda^(2)propA^{2} / \lambda^{2} \proptoA2/λ2 constant as λ 0 λ 0 lambda longrightarrow0\lambda \longrightarrow 0λ0.
  5. *For more detailed and sophisticated treatments of this topic, see, e.g., Tauber and Weinberg (1961), and Lindquist (1966), Marle (1969), Ehlers (1971), Stewart (1971), Israel (1972), and references cited therein. Ehlers (1971) is a particularly good introductory review article.
    • Of course, equation (23.5) only succeeds in defining a new time coordinate t t t^(')t^{\prime}t if it is integrable as a differential equation for t t t^(')t^{\prime}t. By choosing the integrating factor e ϕ e ϕ e^(phi)e^{\phi}eϕ to be just e ϕ = a ( r ) e ϕ = a ( r ) e^(phi)=a(r)e^{\phi}=a(r)eϕ=a(r), one sees that t = t + [ b ( r ) / a ( r ) ] d r t = t + [ b ( r ) / a ( r ) ] d r t^(')=t+int[b(r)//a(r)]drt^{\prime}=t+\int[b(r) / a(r)] d rt=t+[b(r)/a(r)]dr is the integral of (23.5); thus the required t t t^(')t^{\prime}t coordinate always exists, no matter what the functions a ( r ) , b ( r ) , c ( r ) a ( r ) , b ( r ) , c ( r ) a(r),b(r),c(r)a(r), b(r), c(r)a(r),b(r),c(r), and R ( r ) R ( r ) R(r)R(r)R(r) in equation (23.4) may be.
  6. *Historical note: Wilhelm K. J. Killing, born May 10, 1847, in Burbach, Westphalia, died February 11, 1923 in Münster, Westphalia; Professor of Mathematics at the University of Münster, 1892-1920. The key article that gives the name "Killing vector" to the kind of isometries considered here appeared almost a century ago [Killing (1892)].
  7. *These equations were first derived and explored by Zel'dovich and Podurets (1965).